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PROBABILITY THAT TWO DIFFUSING MOLECULES 
WILL COLLIDE 


RoBERT A. WIJSMAN* 
DEPARTMENT OF PuHysics, UNIVERSITY OF CALIFORNIA 
- BERKELEY, CALIFORNIA 


A formula is derived for the probability P(r1) that two molecules, originally ata 
distance 7; from each other and moving in an infinite scattering and absorbing 
medium, will eventually collide with each other. 


The following problem, although physical in its statement, is important 
in its biological applications. Two molecules diffuse in an infinite scatter- 
ing and absorbing medium; they will chemically react with each other if 
they ever happen to meet. Given the original separation, the problem is to 
find the probability that this reaction will have taken place after an in- 
finite time has elapsed. This problem arises in the treatment of the ‘‘dif- 
fusion model” for the action of radiation on the genetic material of single 
cells. In this model the genetic damage is effected by ionization products, 
formed by the radiation anywhere in the cell and diffusing toward the 
genes, while being subject to the possibility of removal by undergoing 
chemical reactions with other cell constituents. Clearly, the probability 
that a certain gene will be damaged by such a diffusing ionization product 
constitutes a particular case of the problem stated above. Another ap- 
plication is found in the H,O, production in a liquid medium under the 
influence of radiation, where one H,O, molecule is formed by two OH 
radicals which approach each other by diffusion. 

For the moment, we shall assume that the two molecules will react 
chemically at their first collision. The extension to the more general case, 
in which the reaction probability per collision is less than one, will be indi- 
cated later. 

For simplicity, we shall assume that the two molecules are spherical in 
shape. The problem can be simplified, without loss of generality, by mak- 
ing one molecule stationary and letting only the other molecule diffuse, 
by assigning the latter the proper mean velocity and mean free path. 

* Public Health Service Research Fellow of the National Cancer Institute. 
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Furthermore, we can consider the diffusing molecule as a point particle if 
we take the radius of the stationary molecule equal to the sum of radii 
of the original molecules. 

The stationary molecule, with radius 7, can be considered as a “‘black”’ 
sphere, i.e., a sphere which completely absorbs an incoming point particle. 
This sphere is surrounded by an infinite scattering and absorbing medium. 
If the point particle is initially at a distance 7; from the center of the 
black sphere, the problem is to find the probability P(r) that it will be 
caught by the sphere eventually. The answer to this problem is the same 
as the answer to the following spherically symmetric and time-inde- 
pendent problem. Given a source of one point particle per unit time dis- 
tributed homogeneously over a spherical surface with radius r; concentric 
with the black sphere, how can we calculate the number of point particles 
absorbed by the sphere per unit time?* Thus formulated, the problem 
could apply to the diffusion of thermal neutrons in a scattering and 
absorbing medium. 

As in neutron diffusion problems, we ignore the velocity distribution of 
the diffusing particles, and pretend that the particles move with a 
velocity v, which is constant in magnitude. Also, we assume that the 
scattering is isotropic. Let the mean free path for scattering be /, and let 
the probability per unit time that a particle will be absorbed by the 
medium be a. Each particle is characterized by its distance ¢ to the center 
of the black sphere, and by the angle @ between its velocity v and the 
radius vector 7. Instead of 6, we shall use wu, where uw = cos 6. We now in- 
troduce the function ¥(7, u), so that 4ar’°)(r, u) dr du is the number of 
particles between r and r + dr with velocity directions between wu and 
+ dy. For y we can derive the Boltzmann transport equation 


ay 1— 29 1 1 
08 Fete tabs the g(r—n), (1) 


2 
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in which 
1 
vo (7) =f vu) du 


equals particle density at r, and 6 is the Dirac 6 function. In equation (1) 
the first two terms represent the loss of particles from interval drdu asa re- 
sult of their motion, the third term is the loss from collisions in drdu to other 
intervals, the fourth term is the loss through absorption in the medium, 
id The writer is indebted to Dr. R. Serber for indicating the possibility of reduction to a 
time-independent problem. 
} See, e.g., S. Chandrasekhar (1950). 
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the fifth is the gain from collisions in other intervals to drdu, and the right- 
hand side represents the spherical source at r = 7;. The calculations are 
simplified somewhat by introducing the dimensionless quantities 


UG r 1 


T 
yi LST Pie A= a, e(p, wu) = Wp (r, Ds (2) 


v 
With the substitutions (2), equation (1) is changed into 
1 


dy 
Me yi 


See 1 
Es pipes hie ca ge pi) (3) 


There are two boundary conditions. At p = po the requirement is that 
there are no particles coming out of the black sphere, and at p — ~ the 
particle density should approach zero: 


¢(p,4) =90, (u>0), (4) 
jum ¢ (p, uw) =0. (5) 


The probability P(r,) is obtained from the solution to equations (3), (4), 
and (5) as the total current entering the black sphere 


0 
P(r) = —4er30 f W(r, nuda, 
and with the substitutions (2) we have: 
0 
P(n) =— 403 f o(m u) udu. (6) 


Equations (3), (4), and (5) cannot be solved exactly, but R. E. Marshak 
(1947) has proposed a convenient approximate method, called the 
“Spherical Harmonic Method,” which we shall use in our problem. The 
method consists of expanding ¢(p, u) in a series of Legendre polynomials 
P,(u): 

o(p,#) = >> (1+ 4) ¢1(9) Pil), 
l 
and breaking the series off after a finite number of terms. We shall only 
keep the first two terms 


o(p, Hu) =300(p) +3ei(p)4, (7) 


calling this the ‘‘Py-approximation.” The validity of this approximation 
will be discussed later. After substituting (7) into (3), applying the 


operations 
Zt Fi d i Pie 
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respectively on both sides of the equation, we obtain two equations for 


go and G1: 
dg, $1 1 
= = 8 
MONT gate p 4np 5(p— pr), (8) 
d 
(14+) +4 fee (9) 


In the Spherical Harmonic Method it is impossible to satisfy boundary 
condition (4) exactly. In the P,-approximation, for instance, (4) would 
read: ~0(po) = 0, ¢1(p0) = 0, but in order to find solutions of (8) and (9) 
we can allow only one boundary condition at p 9. With Marshak, we choose 
for this boundary condition the requirement that the total current leaving 
the black sphere be zero. Thus we replace (4) by 


bY 
J ee Dydu 0K (10) 


Substitution of (7) into (10) and (5) gives the boundary conditions 


390 (po) +¢1(p) =0, (11) 
lim go(o) =0, lim e(p) =0. (12) 


After elimination of ¢, we can solve for ¢» with the following result: 


poo (p) = ae aVSe se —V3X(1+A) |p—p, | 


WAS (dete See (13) 


+ Po eV ites 200) b 
V3Xd) $3140) = 
0 


If we substitute (7) into (6) and keep in mind (11), we obtain P(7:) = 
2m p,Po(po). Evaluating (13) at p = po we finally obtain: 


Z 
Fn) (14-3 Vi a ae 
It is seen from (2) that ) is the ratio of mean free path for scattering to 
mean free path for absorption. Since in most of the interesting cases the 
number of absorbing molecules is small compared to the number of 
scattering molecules, we may assume \ < 1. If we neglect \ compared to 
unity in equation (14), change back to the original variables by means of 
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(2), and introduce D = 4v/, which has the dimension of a diffusion con- 
stant, the final result is 


a | 
P(r) =(1+5> Sex ae (15) 
The dependence of P(r,) on 7; is two-fold: P(7;) is inversely proportional 
to r, as a result of diffusion alone, and proportional to an exponential in 
r, as a combined result of diffusion and absorption. 

If we want to calculate the probability that the two molecules will 
chemically react in case the reaction probability per collision is less than 
one—say, 1 — a—we have to replace the “‘black”’ sphere by a partially 
reflecting sphere. Equations (8) and (9) will still be valid, but the bound- 
ary condition (10) has to be replaced by the requirement that the current 
emerging from the sphere is a fraction a of the current entering the sphere: 


1 0 
i; (po, He) udp = —a f (po, u) udp. 
0 ok 


If the calculation is carried through, it is found that the result is given by 
(15), except that the factor 


(+34) 


(1455755) - 


has to be replaced by 


Validity of the approximation. If 1 «ro, equation (15) simplifies to 


P (173) =~ exp [-Vein-m J, 


a result which can be obtained in a much simpler way from the solution 
of a differential equation rather than the Boltzmann transport equation. 
The purpose of the method of solution used in this paper is to obtain the 
extra factor 

(+z 37, = To 


in case 1 is not much less than 7p. It still remains to be shown what the 
domain of validity of equation (15) is. Since the extra factor is inde- 
pendent of the absorption in the medium, we may take a = 0, which sim- 
plifies the calculation. After the P,-approximation is carried out, it turns 
out that the factor 

Gr 37, 2 ‘9 
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21 Vaan 
Ge ie 


0 


has to be replaced by 


It is clear then that the result of the P,-approximation is not valid for 
values of 1 = ro, but gives a reasonably good approximation if ] < ro. 

The author wishes to thank Dr. Herbert D. Landahl for valuable sug- 
gestions concerning the way of presentation. 
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THE LIMITING INFORMATION CAPACITY OF 
A NEURONAL LINK 


DonaLtp M. MacKay AnD WARREN S. McCuLiocu 
UNIVERSITY OF LONDON KING’s COLLEGE AND 
UNIVERSITY OF ILLINOIS COLLEGE OF MEDICINE 


The maximum rate at which a synaptic link could theoretically transmit 
information depends on the type of coding used. In a binary modulation system it 
depends chiefly on the relaxation time, and the limiting capacity equals the 
maximum attainable impulse rate. In a system using pulse-interval modulation, 
temporal precision may be a more important limiting factor. It is shown that in a 
number of typical cases a system of the second type could transmit several times 
more information per second through a synaptic link than a binary system, and 
the relation between relative efficiency, relaxation-time, and temporal resolving 
power is generalized in graphical form. It is concluded, not that interval modula- 
tion rather than binary modulation “‘ought’’ to be the mode of action of the 
central nervous system, but that the contrary assumption is unsupported by 
considerations of efficiency. 


I. Introduction. The way in which the nervous system transmits in- 
formation has long been the subject of debate. On the one hand, the all— 
or—none character of the nervous impulse and the demonstrated perform- 
ance of motoneurons as coincidence detectors (Lorente de Né, 1939) led 
naturally first to a model of the central nervous system (McCulloch, 1943; 
Pitts and McCulloch, 1947) in which information was represented in terms 
of binary digits, quantized with respect to time as in a serially operated 
digital computer. On the other hand, there has been a steady accumulation 
of other evidence which has been adduced in favor of models (Lashley, 
1942) employing frequency modulation of trains of impulses to represent 
the information transmitted. We refer here not only to the long known 
fact that sense organs deliver trains of impulses whose frequency after 
adaptation is roughly proportional to the logarithm of the intensity of 
stimulation, but also more particularly to recent observations (Brook- 
hart, Moruzzi and Snider, 1950; Gernandt, 1950) that the frequency of 
discharge of spontaneously active central neurons can be altered in re- 
sponse to signals afferent to them. 

In view of these last observations and others like them, it seems realistic 
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to consider how efficiently a typical neuronal link, or ‘“‘synapse,” could be 
used to convey information in this way, and particularly to compare its 
limiting efficiency with that which could be achieved if it were used on the 
digital basis of the first model. 

It is not to be expected that the proponents of either of these models 
believed them to be the whole story, nor is it our purpose in the following 
investigation to reopen the “analogical versus digital” question, which we 
believe to represent an unphysiological antithesis. The statistical nature 
of nervous activity must preclude anything approaching a realization in 
practice of the potential information capacity of either mechanism, and in 
our view the facts available are inadequate to justify detailed theorization 
at the present time. What does seem worth while at the moment is a 
sufficiently general discussion to determine upper limits of performance 
and so provide a quantitative background to lend perspective to the 
framing of hypotheses. 

II. Selective information content. A signal is a physical event that, to the 
receiver, was not bound to happen at the time or in the way it did. As 
such, we may think of it as one out of a number of possible alternative 
events, each differing perceptibly in some respect from all the other possi- 
bilities. A given signal can therefore be considered as an indicator, select- 
ing one out of a finite number of perceptibly distinct possibilities, and 
thus indicating something of the state of affairs at its point of origin. The 
greater the number of possible alternatives to a given signal, the greater 
the ‘‘amount of selective information’ we say it contains. The selective 
information—content of a signal is in fact defined as the logarithm (base 2) 
of the number of alternatives, where these are all equally likely a priori. 
This simply represents the number of steps in a search process among the 
alternatives carried out in the most efficient way—by successive sub- 
division of the total assembly into halves, quarters, and so forth. (We shall 
not consider the case of unequal prior probabilities.) 

Now a neuronal impulse carries information, in the sense that it indi- 
cates something of the state of affairs at its point of origin. It is able, more- 
over, on arriving at an axonal termination, to affect the probability of 
occurrence of an impulse in the axon of the cell with which it “synapses.” 
Thus, whatever else does or does not “‘cross a synapse,” it is safe to say 
that information does. The nature or even the existence of the synapse 
as a physical link is not here in question. For our purpose it will be suf- 
ficient to consider the observable time-relations which obtain between 
incoming and outgoing impulses in cases in which these are temporally 
coherent. These will enable us to estimate the upper limits to the informa- 
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tion capacity of such synaptic links—the maximum number of “bits” 
carried per second—and to compare the informational efficiencies of the 
different modes of operation (called modulation systems by the com- 
munication engineer) which are conceivably available to model makers 
ofthe. CNS. 

In particular we shall examine the suggestion that an all-or-none binary 
code using pulses separated by minimal quantized time intervals would 
be more efficient than a system using the same elements in which the 
varying time of occurrence of impulses represented the information to be 
conveyed. This does not appear to be the case. 

III. The selective information-content of an impulse. To determine the 
limiting information-content of a signal, we have to decide (a) which 
parameters are permissible variables, (b) how large a variation must be in 


FIGURE 1 


order to be perceptible, or rather to be statistically significant, and hence 
(c) how many significantly distinct values of each parameter are possible. 
We ought also to know (d) the relative probabilities of the different pos- 
sible combinations of parametric values, if these probabilities are unequal; 
but unless their differences are great the order of magnitude of the infor- 
mation content will not be affected. (The effect will always be a reduction 
of the average information—content per signal.) 

The limiting selective information-capacity of a signal-carrying element 
will then be the product of the average information-content per signal, 
with the maximum mean signal-frequency allowable for the modulation- 
system adopted. 

Typical time-relations which have been observed in so-called synaptic 
transmission are summarized in Figure 1. 

An afferent impulse which begins to rise at A may give rise to a spike 
(of about 1 msec. in duration) at any time C between B and D, 0.5 to 0.8 
msec. later. The majority of spikes, however, are observed to occur after 
an interval, the “synaptic delay,” Tp, with a standard deviation of less 
than 0.05 msec., and a typical mean value of the order of 0.65 msec. 


(Lloyd, 1946). 
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No matter how high the frequency of stimulation, successive spikes are 
not observed to occur within time intervals Ts of less than 1 msec. Fre- 
quencies of 500 per second are not long sustained, and a reasonable upper 
limit for a steady response to recurrent stimuli would perhaps be 250 per 
second. In central auditory nerve fibers for which unfortunately no con- 
stants of monosynaptic transmission are available to us, maximum fre- 
quencies, after adaptation, of 250 to 300 c/s have been observed (Galam- 
bos and Davis, 1943). There is thus a minimal “dead time” T, between 
impulses which ranges from 1 to 4 msec., according to the demands made 
on endurance. 

Now the amplitude of a nerve impulse is relatively slowly variable, and 
the total amount of information conveyable by such variation would be 
relatively small. For our purpose we shall assume that the amplitude can 
have only two values, namely, zero or unity on an ad hoc scale. Thus each 
impulse, simply by its presence or absence during the short period over 


FIGURE 2 


which it was expected, could provide one unit or “‘bit’’ of selective infor- 
mation. If, therefore, we had a system in which we “‘quantized”’ the time 
scale by dividing it into discrete intervals equal to the minimum necessary 
separation 7, between impulses, the system should on the above basis 
be able to convey 1/7, bits of information per second at most. Such a 
system we may call a time quantized binary system (McCulloch and 
Pitts, 1943). 

There is, however, another variable parameter of our neuronal signal. 
The time interval between successive impulses can vary, so that any given 
interval represents a choice out of a certain range of possible pulse posi- 
tions on the time axis. Let us denote by 7's the time interval between one 
pulse and its immediate predecessor (Fig. 2), and suppose that for reasons 
which we consider later 75 has a maximum permissible value Ty, and of 
course a minimum T,. The range of Ts, (Tm — Tp), may be thought 
of as subdivided into a finite number of equal intervals, of magnitude AT. 

These intervals must not be smaller than the average fluctuations in T, 
(Fig. 1) if changes in Ts are to be statistically significant, and we may 
conveniently assume at first that they are of the order of 0.05 msec. 
(Lorente de N6, loc. cit., p. 409). We thus think of our impulse as selecting, 
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by its occurrence at T's, one out of (Tu — T,)/AT possible positions on 
the time axis. Calling the ratio (Ti — Tz)/AT, n, we have here log, n 
bits of selective information per signal, in respect of its instant of occur- 
rence. 

Now for maximum efficiency our hypothetical system should on an 
average use all 2 possible values of 7’; equally frequently. The mean value 
of 7's will therefore at maximum efficiency be }(Tu + Tz). A “pulse in- 
terval modulation” system could evidently convey in this way a maxi- 
mum of 


2 
Screamers se loge nN 
bits of information per second. 

It is here that we find a criterion for the optimum value of Ty. Clear- 
ly if Tis allowed to increase indefinitely, the number of bits per impulse 
also increases indefinitely—but only roughly as the logarithm of Ty, 
whereas the number of impulses per second goes down roughly as the 
inverse of Ty, itself. The optimum value of Ty is that which maximizes 

Vy Tu al Ln 


PR RETe logs —— AT (1) 


If we take AT as our unit of time, writing Ty = mAT and T, = rAT, 


we have 


Zz 


This is maximum for a given r when 
m+r 
th 


m— 


=In(m—vr). (3) 


Taking a value of 1 msec. for Tz and 0.05 msec. for AT, we find by 
approximate solution of (3) that at Ty, = 2 msec. a maximum selective 
information capacity of about 2.9 bits per msec. could theoretically be 
be attained, as against 1 bit per msec. on the binary quantized system. 
Under these conditions the mean frequency would be about 670 impulses 
per second and the average information content about 4.3 bits per impulse. 

IV. First approximations to realism. Our discussion hitherto has been 
mainly academic. We have seen that a communication engineer could 
use an element with the properties idealized in Figure 1, nearly three 
times as efficiently in a pulse-interval modulation system as in a binary 
system at the maximum possible pulse frequency. We have favored the 
latter as much as possible by (a) granting a maximum frequency of 1000 
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per second; but (b) we have not yet asked whether the precision of timing 
between stimulus and outgoing impulse is matched by a corresponding 
precision of time-resolution in possible receiving centers; (c) we have not 
considered the possible increase in the variance of synaptic delay due to 
irregularity in the pulse sequence; and (d) we have assumed all values of 
the modulated pulse-interval to be equally likely, irrespective of the 
position of its predecessor. This last assumption is valid in an estimate of 
limiting capacity. 

In other words, our computation is probably not too unrealistic as an 
estimate of the maximum number of bits per msec., or even per 10 msec., 
which could be represented by the physical behavior of the signal. But for 
an estimate of the information effectively carried we must, as we saw 
above, consider the number of possibilities distinguishable by the receiver; 
and for an estimate of continuous information—capacity we must accept a 
lower value for the mean impulse-frequency. 

Information on points (b) and (c) is scanty. It is known (Lorente de 
Né, loc. cit., p. 422) that the period of latent addition of two converging 
impulses, over which their relative arrival-time may vary without ap- 
preciable effect, is of the order of 0.15 msec. This might seem to suggest 
that our scale unit of time AT should be of the same magnitude, or even 
twice this, since the relative delay may be of either sign. But the index of 
latent addition falls off very sharply over a few hundredths of a milli- 
second when the relative delay exceeds 0.15 msec. As a time resolving in- 
strument, therefore, such a summation mechanism can in principle de- 
tect coincidence to much finer limits (as may be seen by imagining a con- 
stant delay of 0.15 msec. to be introduced into one signal path), so that it 
is not obvious that the operational value of AT merits much increase. 

Not much more is known about the effects of irregular firing on synaptic 
delay. A range of 0.4 msec. is given by D. Lloyd (loc. cit.) as the difference 
between the delay measured for a relatively refractory neuron which had 
just fired, and that for a relatively excitable neuron receiving summating 
impulses. In the case we have considered above, which is relatively least 
favorable to interval modulation, the interval between successive pulses 
might vary from 1 msec. to 2 msec. and might reasonably be expected to 
affect synaptic delay by perhaps 0.1 to 0.2 msec. 

Let us suppose for a moment that in the worst case an imprecision 
AT of 0.3 msec. had to be accepted, and that we again tolerate a mini- 
mum pulse to pulse interval of 1 msec. The same calculation as before 
shows that at the optimum value of Ty, around 3.1 msec., an average in- 
formation capacity of some 1.4 bits per msec. would be attainable, so that 
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even under these extreme conditions a binary system would still be less 
efficient than the other by 40 per cent. 

We must, however, make a further step toward realism by estimating 
the capacities for a higher value of the refractory period Tp. Taking a fre- 
quency of 250 per second as reasonably attainable after adaptation 
(Galambos and Davis, Joc. cit.) we can set Tp = 4 msec. as perhaps a 
typical ‘‘adapted” value. We then find that when AT = 0.05 msec. the 
optimum value of Tx, is 6.6 msec. and the optimum mean frequency 190 
per second, yielding as much as 1090 bits per second or nearly 6 bits per 
impulse. (At the maximum possible rate of 250 per second the cor- 
responding capacity on a digital basis is of course only 250 bits per 


5 
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Ficure 3. Relative efficiency of interval and binary modulation 


second.) Even if AT must be as large as 0.3 msec., a capacity of 620 bits/ 
sec. can be attained at Ty, = 8.6 msec. and a mean frequency of 160 
per second—still 23 times the maximum capacity of the same element 
used in a binary system. Figure 3 shows in general how the ratio of the 
information capacity with interval modulation (Cr) to that with binary 
modulation (C,) varies with the ratio T,/AT. = 
Turning finally to consider normal as opposed to optimal conditions, 
we may take 50 impulses per second as a typical mean frequency of excita- 
tion of sensory fibers when transmitting information. Our calculation is 
now straightforward. A mean pulse to pulse interval of 20 msec. could 
correspond roughly to a minimum 7% of (let us say again) 4 msec., and a 
maximum of 36 msec.—a range of 32 msec. Each impulse then selects one 
out of 32/AT possible positions, so that if AT were taken as 0.05 msec., the 
selective information content of each impulse would be log: 640 or about 
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9.3 bits per impulse, giving an information capacity of just under 500 bits 
per second despite the low impulse repetition frequency. 

In general if a different value be taken for AT, say & times greater 
(Tw and T, remaining the same), the effect is to reduce the selective in- 
formation content per impulse by log, &. Thus an increase of AT to 0.2 
msec. allows only 7.3 bits per impulse; an increase to 0.4 msec., 6.3 bits, 
and so on. It should be noted, however, that these are figures for a fixed 
value of Ty and are not the maximum figures for each value of AT, 
which would be determined by optimizing Ty, as in the preceding sections. 

It is of course exceedingly unlikely in practice that successive impulse 
positions would be statistically uncorrelated in these cases where so large 
a percentage change is theoretically possible in successive pulse intervals. 
It should be noted moreover that we have considered only one synaptic 
link. If a number of these were connected in series, the precision of tim- 
ing would of course diminish, roughly as the square root of the number of 
links. Our qualitative conclusions should hold however for reasonably large 
multisynaptic links, since, as shown in Figure 3, the interval modulation 
system does not become inferior to the binary until AT exceeds 3Tp. 

V. Conclusions. Our discussions have here been concerned with the 
limiting capacity of a neuronal link as a transducer of information. We 
have seen that the observed time-relations between incoming and out- 
going impulses indicate a precision that would justify the use of pulse in- 
terval or “frequency” modulation rather than binary all-or-none ampli- 
tude modulation in a communication system using such a transducer. 

At the maximum pulse frequency normally attainable, the binary system 
would he inferior to the other by a factor of 2 or 3 at least. At the typical 
peripheral frequency of 50 per second, the theoretical factor in favor of 
interval modulation rises to 9 or more. Little significance is attached to 
the precise figures obtained, but they indicate the theoretical possibility 
that a maximum of between 1000 and 3000 bits per second could be trans- 
ferred at a synaptic link. 

It appears to the writers at least that the question is still mainly of aca- 
demic interest. Its discussion must not be taken to imply a belief that 
either binary coding or pulse-interval coding in the communication 
engineer’s sense are the modes of operation of the central nervous system. 
Much more likely is it that the statistically determined scurry of activity 
therein depends in one way or another on all the information-bearing 
parameters of an impulse—both on its presence or absence as a binary 
digit and on its precise timing and even its amplitude, particularly on the 
effective amplitude as modified by threshold control, proximity effects and 
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the like. If cerebral activity is the stochastic process it appears to be, the 
informationally significant descriptive concepts when once discovered 
seem likely to have as much relation to the parameters defining the 
states of individual neurons, as concepts such as entropy and temperature 
have to the motions of individual gas molecules—and little more. 
Perhaps the most realistic conclusion is a negative one. The thesis that 
the central nervous system “ought” to work on a binary basis rather than 
on a time-modulation basis receives no support from considerations of ef- 
ficiency as far as synaptic circuits of moderate complexity are concerned. 
What we have found is that at least a comparable information capacity is 
potentially available in respect of impulse timing—up to 9 bits per im- 
pulse in the cases we considered—and it seems unlikely that the nervous 
system functions in such a way as to utilize none of this. It is considered 
equally unlikely that its actual mode of operation closely resembles either 
form of communication system, but this quantitative examination may 
perhaps serve to set rough upper limits and to moderate debate. 
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Let two or more individuals each possess different quantities of two objects 
of satisfaction. Under certain conditions they may agree to exchange part of the 
objects if this leads to an increase of each one’s satisfaction. The equations which 
govern this process have been derived by G. E. Evans (1930) for the case of two 
individuals. A different proof of these equations is given here and the equations 
are generalized to the case of more than two individuals. 


The problem which we are going to discuss originated in economics, 
though it is of importance in more general social problems. We are con- 
cerned with two individuals who possess corresponding amounts a, and ay 
of one commodity, and 6; and & of another commodity. These individuals 
may trade the two commodities in such a way that after exchange they 
correspondingly possess the amounts x; and 2 of the first commodity and 
y, and 2 of the second. We then have 


1+ %_ = 1+ 2 ; (1) 
Mmtyw=bt+h. (2) 


The problem is to determine the values %, «2, y1, and yz. Evans (Joc. cit., 
pp. 125-28) has given a solution of the problem based on considerations 
of the utility function. His proof, semi-geometric, is somewhat involved 
and is not readily generalized to several individuals. 

In our previous work (Rashevsky, 1947, 1951) we used the concept of © 
satisfaction function, which at least formally is identical with the utility 
function of the economist. We shall now give a proof of Evans’ solution in 
what we believe is a simpler way, and shall indicate its generalization to 
more than two individuals and more than two commodities; or, as we 
prefer to call them in a more general way, objects of satisfaction. 


chole 
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Denote the satisfaction functions of the two individuals correspondingly 


by 
Sa (¥1; V1) 5 So (%2, Yo). (3) 


Let the two individuals trade the two objects at a constant “‘price”’ #, 
so that if, say, the first one gives away an amount da, of x and receives 
in return an amount dy of y,, then always 


oye Oa (4) 


Since p is constant, therefore this implies 


Vy Oy eee ee 
C= BY X_ — ag 


raid A (S) 


The first individual will be willing to trade as long as by giving away 


an amount da, and receiving an amount dy, = — pdx; his S,(x, y1) is in-* 
creased. The change of S; due to giving away an amount dx; is equal to 
OS; 
Oxy dx}, (6) 


whereas the change of S; due to receipt of dy, is equal, because of equation 
(4), to 


eee thy 
yi ge eae ee (7) 


Hence the first individual will trade as long as dS > 0. Because of dx, < 0 
this implies 


= p< 0g (8) 


His satisfaction function S; will be at a maximum for such values of x 
and y,; for which 


85s _ 8S; 
Ox, Oy. (9) 


By a similar argument we find that the second individual will trade 
until 
OSs OSs 
Croce dee (10) 


A complete agreement will be reached if, and only if, both (9) and (10) 
are satisfied. Putting 


05 et Be he ast aSs aSs 
Tyg to cag = ho iol eee 
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we see that the values x, 2%, 1, and 4, for which neither individual will 
will want to trade any further, satisfy the equations: 
X1(%1, 91) = PV 1 (41, 1) ; (12) 
X2(%2, Yo) = PV2(Xa, yo). (13) 
Together with equations (1), (2), and (5) we have five independent equa- 
tions for the determination of the five variables a, 4%, y1, yo, and p. 


Eliminating » from equations (5), (12), and (13), and making use of 
(1) and (2), we find: 


X 1 (4%, V1) = Vier; 1) 
Xo (4+ de — %4, bi + b2— 91) Vo (a+ a2 — %4, by + be — 41) 


and 


(14) 


Yi— 41 X1 (%1, 1) 


X%1— ay, VY, (%1, 1) é 


(15) 


Equations (1), (2), (14), and (15) determine 21, x, 1, and ye. They are 
identical with those given by Evans. 

This derivation is easier to generalize for the case of three or more 
individuals trading only two objects of satisfaction, than is the derivation 
given by Evans. For three individuals we have: 


X14 %2+%3=a,+4+43, (16) 

vit vot v3 = b:+ b24+ 53 , Gee) 
and 

(DIMOU TEC ETE ae 

To eS Re ae 
Putting 

aS, _ asi 

Epp Sar prea ee uae 


we now find, by the same argument as before: 


Xi (4%, yi) =P V1 (41, 91); Xo(%2, Yo) =pPVo2(%2, v2); 


(20) 
X3 (43, V3) =pY;3 (xs, 3) : 
From equation (18) we have 
Pier e Vive tl yess 5 (21) 


Only two of the equations (21) are independent, because of equations (16) 
and (17). Thus we have seven equations to determine the seven variables 


1, X2, X3, Mi, V2, V3, and p. : ; 
An interesting generalization is suggested by considering / not as con- 
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stant, but as a function of a; and 4, in the case of two individuals. Instead 
of equation (5) we now shall have a more general expression of the form 


P (41, v1) = Const . C220 


If p(%, 91) is prescribed, we proceed as before. We may however 
attempt to determine such a form of p(«1, y1) which, for instance, gives 
a largest possible value to either S;, So, or Si + S_ after the trading is 
completed. 

The author is indebted to Dr. George Karreman for reading the manu- 
script. 
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An analogy is drawn between a hierarchical communication network and a 
pyramidal election system. An equation is derived relating the probability that 
a given person will vote for an issue to the probability that the elected officer 
willalso vote for that issue. The equation contains the size of the population, the 
required majority ratio, and the number of stages in the election system as 
parameters. 

Such an election system is compared to the simple majority vote without a 
hierarchy. The absence of a hierarchy appears to favor the minority voice. 

All of the results are translatable into the language of neural nets. 


Neural nets, groups of cooperating individuals, computing machines, 
and a great variety of other phenomena can be thought of as connected 
systems of elements transmitting certain changes of state. It is not surprising, 
therefore, that the language and nomenclature of these various phenome- 
na are to some extent interchangeable. Furthermore, as we might also ex- 
pect, certain quantitative relations seem common to all of them. Thus, 
some of the mathematical formulations, though couched in the terminol- 
ogy of neural nets, have interesting implications in the theory of epi- 
demics. 

The use of a specific terminology may, if taken too seriously, divert our 
attention from the more general aspects of the mathematical formulations. 
It may, on the other hand, serve as an intuitive guide in developing the 
equations. In this paper the terminology of neural nets will be emphasized, 
but some other interpretations will also be given. 

Communication networks can be divided into two broad classes—those 
which contain closed loops (feed-back systems) and those which do not 
contain closed loops. An interesting example of the second category is the 
hierarchical structure. 

Hierarchical structures have been met in the theory of neural ontogeny 
(Shimbel, 1948, 1950; Rapoport and Solomonoff, 1950). 

Figure 1 is an example of a hierarchical communication net. In particu- 
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lar, we may think of it as a group of interconnected neurons, the heavy 
dots representing the cell bodies and the arrows their axones. The axones 
of the outermost cell bodies are seen to converge in groups of four to the 
cell bodies of “higher order’’ neurons in a circle just inside of the peripheral 
one. This process is repeated until the axones of the innermost circle con- 
verge upon a single ‘‘master neuron”’ located at the center. 

The circular arrangement of neurons (or, more generally, elements) used 
in Figure 1 to illustrate the hierarchical structure is arbitrary. Pyramidal 
and tree-like arrangements would be completely equivalent so long as the 


si \Pa 
eo 


Za Ky 


FIGURE 1 


order of the connections was not disturbed. The important thing here is 
not the geometry, but the hierarchical ‘‘topology”’ of the system. 

Let us suppose that the master neuron receives axones from a different 
neurons (on the innermost circle), and that each of these receives axones 
from a different neurons from the next circle, and so on. Now, if we trace 
any pathway from a peripheral neuron to the master neuron, we will in 
general make contact with K neurons (including the master neuron), 
where K is the number of concentric circles in the hierarchical diagram. 
Each neuron in the system can, therefore, be thought of as a member of a 
“‘stage”’ of the hierarchy. The stages can be numbered from zero (the out- 
ermost circle) to K (the master neuron). 

In general, any neuron in the system will fire if it is sufficiently stimu- 
lated by its subordinates (those neurons which send axones to it). The 
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number of subordinates which must act together (within the period of 
latent addition) in order to comprise a sufficient stimulus may be thought 
of as the threshold a — 8 of the neuron, where 8 is the maximum number of 
subordinates which may fail to fire during a sufficient stimulation. The 
peripheral cell bodies receive no axones and must, therefore, be thought of 
as receptors. 

In particular, we may suppose that the receptors are photo-sensitive - 
and that they have been exposed to a uniform field of light for a brief inter- 
val of time (less than the period of latent addition). This stimulus will 
cause some of the receptors (those which have absorbed enough energy) to 
fire. The number of receptors to become so activated will, of course, de- 
pend upon the intensity of the stimulus. 

If the intensity is sufficiently high, then the probability that a large 
number of the receptors will fire is also high. This would result in the effec- 
tive stimulation of the more central neurons, which after K stages would 
bring the master neuron into activity. 

Conversely, for low stimulus intensities, we should expect that only a 
few of the receptors would be activated, so that correspondingly fewer 
central neurons would come into play, thus greatly reducing the probabil- 
ity that the master neuron will eventually be fired. 

Let us denote with p; the probability that a randomly selected neuron 
of the kth stage is eventually brought into activity as a consequence of 
such an initial stimulation of the peripheral receptors. Similarly, we shall 
denote by px the probability that the master neuron will fire, and by fo 
the probability that any given receptor will fire. Our first problem is to 
determine the relation which must exist between px and po. 

Note that any neuron of the Ath stage will fire if, and only if, a — 8 or 
more of its subordinate neurons have fired. (They must necessarily be in 
phase, because of the brief stimulation period.) This, however, can happen 
in a great many different ways. 

It could happen, for example, that all of the subordinate neurons fire. 
(This could occur in only one way.) It could also happen that all but one of 
the subordinates fire (that is, 8 = 1). In the latter case the failing neuron 
could be any one of the a subordinates. 

Now, in general, 7 of the subordinates can fail to fire, where 7 is any 
integer smaller than or equal to 8. These i failing neurons can be chosen 


among the a subordinates in exactly ( i) different ways. 


It follows from the above considerations that p;41, the probability that 
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a randomly selected neuron of the (& + 1)st stage will fire, is given by a 
sum of mutually exclusive probabilities, namely, 


B 
a a—z a 
per (4) # (1 —,):.. (1) 


It follows from equation (1) that p.4: can be expressed as a polynomial of 
degree a in p;,; in fact, after elementary transformation we obtain: 


m= SSH OQcom] (2) 


Expressions (1) and (2) are iterations on &. They are, therefore, formal 
solutions to the problem. After K iterations we can (at least in principle) 
obtain px as a function of po. 

Neither of these expressions, however, lends itself readily to further 
analysis. An alternative formulation may be obtained by the following 
device. 

Let 7; denote the ratio of the probability of firing in the kth stage to the 
probability of failure. Thus 


1, = je Pr . 
It follows that 

Pr Tre 1 
and 

1 
1— Pr = asad : 
The definition of 7;, together with equation (1), implies that 
1 
Piti= aE oe Gey, 


Now consider the following definitions: 
N Cre, a) = (rel), 
F (rz, a, 8) =the first 8+ 1 terms of NV 3 
R (rx, a, 8) =the last a — B terms of V . 
It follows from the above definitions that V = F + R. This permits us 
to write expression (3) in a more convenient form, namely, 


F (ris Qa, B) 
Piti= oN Cea & p (4) 
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Furthermore, since 
Pet1 
AS eer geer 


1 — pyr’ 
we obtain 7;,; as a function of 7, namely, 


F 
r= (S) 
from which we find: 

F Cr; Qa, B) 


ee (ra: a, B) ; oo 


Before proceeding it is interesting to note that equations (4) and (6) can 
be interpreted to represent a “pyramidal” election system. 

Let us suppose that a population must decide in favor of or against a 
certain issue, and that they choose to render this decision by means of a 
ballot. 

They could of course choose to settle the issue by a simple majority 
vote. This, however, is only one of many possible methods. The population 
could, for example, divide itself into many groups, each group electing an 
official by popular ballot within the group. The officials so elected could 
then decide the issue by popular ballot among themselves, or break up into 
smaller groups, thus carrying the process a step further. The electoral col- 
lege of the United States (at least technically speaking) represents such a 
group of elected “‘electors.’’ This procedure is carried much further in the 
Soviet Union. 

If such an election were to take place and the official ultimately elected 
was given the job of deciding the issue at hand, and if, furthermore, all of 
the intermediate officers were completely honest with their constituents, 
then expressions (4) and (6) could be used to predict the outcome of the 
election. 

In such an election the ratio 


a—p 


would be a measure of the required 


a 
majority. Dishonesty in an elected official would be equivalent in the 
language of neurons to the failure to fire in spite of adequate stimulation. 
Expression (4) can be easily modified to account for the case of “‘failure 
to fire in spite of adequate stimulation.’ Unfortunately, however, the 
complexity of the resulting expression makes it very difficult to use. 
Its form appears below: 


B 


2S 36) (Ge ed Op) 7) 


t=0 7 
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where C is defined as the probability that an adequately stimulated ele- 
ment will nevertheless fail to fire. 

The use of election terminology suggests a simple check on equation (4). 
Suppose that there are two candidates for an office in the (k + 1)st stage, 
and that each of the constituents of stage k are equally disposed to vote 


for either of them. This means that p; = 3, whence 
Pr 
p= 1. 
Vk eae 
Now, for simplicity, let a = 28, that is, an election ratio 
lie 
r empulsy 


is required for election. (Actually both candidates can win.) It seems in- 
tuitively evident that under these circumstances the probability 41 (that 
a given candidate will be elected) should be exactly one-half. 

From the definition of N(r:, a), we have N = (1 + 1)? for r, = 
Also, F(1, 2, 8) is simply.the first half of N. Therefore, pi4. = F/N = 
as we expected. 

In the foregoing example we found that if p, = 3, then p;4, is also one- 
half, regardless of the size of the population. We should expect, however, 
that if the population is biased (that is, if p; is greater or less than 3), 
px+i will be more strongly biased in the same direction as p;. The exaggera- 
tion of the bias in p, expressed by #4, should depend upon the size of the 
population. (For the trivial case of a = 1, we will always have ps1 = px 
regardless of bias.) 

To see how the size of the population affects an exaggeration in the bias 
of px we can look at equation (6). Let us suppose that p, > } and, for 
simplicity, let a = 28. This implies that 7, > 1. By definition 


fe ee 


From the symmetry of the binomial coefficients we can rewrite this ex- 
pression to read 


RIE Re 
4 


> 


But also 
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It follows from the foregoing that for every term (*) r, ‘in the nu- 

t 
merator of 71,4: there is a corresponding term in the denominator, namely, 
is) r,. Now since 7; > 1, the powers of 7; will be even larger. But since 


a —2> 1, the “exaggeration’’ of 7, will be greater in the terms of the 
numerator than those of the denominator. This effect will be increased as 
a is increased. Thus the larger the population, the larger will be the exag- 
geration of px. 

In the above example we assumed that 8 = 3a. This, of course, need 
not be true in general. An increase in 8 would be the same as lowering the 


ip 


threshold a — 8, or the election ratio 


If we let are represent the ratio of the change in ;4; as B goes from 
6B — 1 to B, then we have 


vere) o cS (8) 


AB R 


Note that for 7; > 1, that is, p, ~ 1, the denominator of expression (8) is 
very large compared to the numerator, so that 


APrti 
AB 


approaches zero. Similarly, as 7; — 0, that is, as p, — 0, the denominator 
approaches unity and the numerator approaches zero. The ratio 


appears then to be most sensitive in the region of r, ~ 1 or py ~ 5. 

In order to examine more closely the behavior of p;4: as a function of 
px, a, and 8, we can write expression (1) as the ratio of two integrals, 
namely, 


chi “kya-B-1 (1 — x) dx 
0 


(9) 
sf axe-b-1 (1 — x) 8dx 


Prti= 


The identity of expressions (1) and (9) can be readily proved by differ- 
entiating both expressions with respect to p, and comparing the deriva- 
tives. 
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Note that the denominator (D-!) of expression (9) is the beta-function 
of a — Band B + 1, that is to say, 
D1=B(a—6, B--1), (10) 
where . 
=: ACG) = Hp a—ldx : 
B(m, n) i xml (1 — 2) 


In general, however, 


[ (m)T (n) 
Vi aes 11 
B(m, n) Tite Clay 
where I’ represents the well-known gamma-function. It follows then that 
ae Ease) eae 12 
D1= Takt (12) 
However, for integral , '(~) = (m — 1)!; hence 
pe ees (13) 
a. 
But since 
(=~ 
67 Ma eels 
we have 
--0-9-(5) 
so that 


Prti= (a—p)(%) fer a) ede. (14) 


In order to examine a particular case, let us suppose that a = 28 and that 
6 is sufficiently large to permit an approximation of 8! by Stirling’s for- 
mula. For these conditions we have 


p= Ee 


pon = Ve ap [1 — 2) ede, (15) 


hence 


By direct integration of expression (15) we obtain 


z ne B 
pen=Ve sae ee G) saa 


Note that for p, sufficiently close to zero the first term of the poly- 
nomial in expression (16) becomes dominant so that, roughly, 


(16) 


busi ag hm Agh. (laa 
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Expression (17) is a good approximation only if Bp; is small compared 
to one. A few values are listed in Table I with 6 taken equal to ten. 


TABLE I 


Dk 1/1000 1/100 1/90 1/80 | 1/70 


| 


Lye | 2S XK MP 28 SS OO) OG SOME DSSS OREN ID Se Te! 


If p, is sufficiently small to permit the use of expression (17) then we 
note that p:4: is very much smaller, so that we would be more justified to 
use equation (17) for successive iterations; hence for /o sufficiently small 


we have 
Pe AP, 
p, = Abt ps 


— Ab*—l+p1-2+ ... +1 pi 
p, = Ae +8 pie. 


Now, if K is the number of stages in the hierarchy, then since 


oe 
Le eee ee 
[ene | 
we have 
p= AG 18-1) pF (18) 
For large 8 we have 
i eas ML 
(elimeag ch ae oe 
so that expression (18) is approximated by 
ita tis) Pelee eS 3 19 
Pg = AGTH = sal pre. (19) 


On the other hand, if instead of a pyramidal ballot a direct majority 
vote were taken, the probability px corresponding to px would have a very 
different form. In fact 


2 TG K fyi — 
p= ne (28) 2%} /[ (28% /2] So RICOEM ony 
A Bs ( : ) 
It is rather difficult to compare expressions (19) and (20) for arbitrary 
Band K. 
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On the basis of a few examples, one of which is given below, it would 
seem that, in general, px < px. 
For the cases a = 4, 8 = 2, and K = 2 we have 


_ 6528 . 4069 
be@Geg1 1 fe ee eas de 


This and other special cases indicate that the hierarchy strongly sup- 
presses the minority voice. 

This analysis is, of course, not nearly general enough to constitute an 
adequate criticism of actual election systems. Many other factors, which 
have not been considered here, enter into the real situation. 

It would be desirable to say something about ;41 as a function of p; for 
the region p; ~ 3. 

Let us assume that the function p;, = p(k) can be approximated by a 
smooth curve continuous in & and having a continuous first derivative 
with respect to &. In such a case we could differentiate expression (14) and 
obtain 


dp (k) 


Ok 


wee 


= (a8) (5 ib (AC = p(B) yan 


k=i+1 


Equation (21) is a nonlinear differential difference equation, so that 
direct solution is difficult. It does, however, tell us something about the 


behavior of p(k). . 
To begin with, notice that for the trivial cases of p(k) ~ Oand p(k) ~ 1 
Walia! 
Ok 


is small, as we would expect. We know from our discussion of ‘“‘bias”’ that 
for p(k) = 4 
dp (Rk) 


tha: 


This condition is consistent with equation (21). We also know that if 


p(k) differs slightly from 4 (and a = 28) that this bias will be exaggerated. 
In fact, roughly, 


dp (R) _4{8 wets ate 
iat) os 
hence | 
Op (k) B of ee 
dk We, e E ee (22) 
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Equation (22) tells us that the slope of the curve p, = p(k) rises very 
rapidly with increasing &, the rate of this rise being proportional to the 
square root of B. 

Further development of this theory should include the possibility that 
elected officers will not always follow the dictates of their constituents. 

This investigation is part of the work done under Contract No. AF 
19(122)-161 between the U.S. Air Force Cambridge Research Laboratories 
and the University of Chicago. 
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The problem of finding the ‘weak connectivity’’ of a random net is reduced 
to one involving a Markov process. This provides a mathematically exact treat- 
ment of the problem which had previously been treated by an approximation, 
whose justification was not rigorous. The exact method allows in principle not 
only the calculation of the “weak connectivity,” but also of the “strong connectiv- 
ity,” and, in general, the probability that from a randomly selected neuron in 
the net there exist paths to a specified number of neurons. The computations 
become exceedingly involved for large nets. 


A previous paper (Solomonoff and Rapoport, 1951) dealt with an ap- 
proximate method for determining the ‘“‘weak connectivity”’ of random 
nets. The reader is referred to that paper for the definitions of terms and 
the statement of the problem. 

In this paper we will indicate a mathematically exact method for 
calculating the probability that from an arbitrarily selected neuron in 
a random net paths exist to any specified number of neurons. The ex- 
pected number of such neurons is then the ‘‘weak connectivity.”” On the 
other hand, if the specified number is the largest possible, the associated 
probability is the ‘‘strong connectivity.” 

We follow the “tracing procedure’’ described in the previous paper. 
Let x(¢) represent the actual number of neurons contacted in all by the 
ith tracing inclusive, and let y(¢) be the number of neurons newly con- 
tacted by the éth tracing. Then 


x(t) = >? y(i). (1) 
1=0 


Let p(z, t) be the probability that there were « neurons in all contacted 
by the éth tracing. Then (a, ¢) depends not only upon the possible values 
of x on the (¢ — 1)th tracing, but also upon those values on the (¢ — 2)th 
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tracing, since only the newly contacted neurons of the (¢ — 1)th tracing 
are traced on the ¢th. If a(t — 1) = i and x(¢ — 2) = 7, then 


P(x, i) = 72-11 e— g)5 4). (2) 


Here 7;,(s, m) is the probability that & neurons will be newly contacted, 
when there are s axones being traced and there have been m neurons al- 
ready contacted. The equation 


ro(s,m)=(" 5”) oben t k= i" (3) 
7=0 


was derived by A. Rapoport (1951). This can be more compactly written 
in the notation of finite differences as 


TECS; Pacey ao (4) 


where A*(m*) is given by the following iteration formula: 
Amn =n ya 


A* (mm!) = AF] (m1) *) = APT ay) . 
Consider now an abstract system with NV? + 1 possible states. We can 
designate each state by an ordered pair (7, 7), where 7 and 7 range inde- 
pendently from 1 to NV, and an additional (initial) pair (0, 1). Furthermore, 
suppose that if the system is in state (7, 7) at time #, then the probability 
that it will be in state (, /) at time ¢ + 1 is independent of ¢ and equals 
pl(i, 7), (R, 1]. Hence if we designate by D[(z, 7), ¢] the probability density 
distribution over all the possible states at time #, this distribution at time 
¢ + 1 will be given by 


Di(k,l),t+11= > pl@ 7), (ke, DIDIG, 7), #1. (6) 
(@) 7) 

This equation is obtained by summing over all possible ways in which the 
system can make transitions from states (7, 7) to a state (k, 1). The p’s are, 
of course, the transition probabilities. The D’s are probability density dis- 
tributions of the V” + 1 possible states. We shall refer to the D’s simply 

as the distribution vectors. 
It is now clear that we are dealing with a process which can be described 
in terms of a vector-matrix equation, in fact, a Markov process. The 
vectors have N? + 1 components and the matrices V4 + 2N? + 1 com- 
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ponents. If we designate our transition matrix by P, equation (6) can be 
immediately generalized to 


Dili") =P*D (i). ait) 


This follows by a simple induction on n. 

In terms of our problem we then have the following interpretation. 
Let i(¢) be the total number of neurons contacted by the (¢ — 1)th tracing 
and 7(¢) the total number contacted by the /th tracing. Then p[(, /), 
(k, 1)] is the probability of the following combination of events: 


1. At ¢ — 1, there were 7 neurons contacted in all. 
2. At t, there were 7 neurons contacted in all. 

3. At #, there were & neurons contacted in all. 

4. Att + 1, there will be / neurons contacted in all. 


Evidently 2. and 3. are either identical or incompatible. They were 
stated separately only to give meaning to the quadruple index (i, 7, k, /) 
in terms of which the vector-matrix formulation was obtained. In view of 
the meaning of our indices, we see that p[(z, 7), (k, 2)] = 0 unless: <7 = 
k <1, so that our transition matrix P has non-zero elements at most at 
those loci [(z, 7), (7, /)], wherei $7 S 1. 

Now let an initial condition be known, e.g., at / = 1 exactly one neuron 
is contacted. Then the initial distribution vector-is (1, 0, 0...) with the 
unity representing the certainty of the initial state (0, 1) at ¢ = 1. The 
elements of the transition matrix are given by 


as oe 5 Won Ass) eS) 
pl, (i DI=Q_?) ee (8) 


This is essentially the expression (4) with 


k =1— 7, the number of newly-contacted neurons at ¢+ 1 ; 
m = j , the number of neurons in all contacted at ¢ ; 


s = a(j-—i), the number of axones to be traced at ¢ . 


To find the final distribution vector at ¢ = + 1, we multiply the 
initial distribution vector (1, 0...) by the mth power of the transition 
matrix. Note that if on any tracing no new neurons are contacted, there 
will be no axones to trace on any subsequent tracing, and, therefore, the 
total number of neurons contacted will remain stationary. But there are 
exactly NV — 1 uncontacted neurons at ¢ = 1. Hence there can be at most 
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N — 1 tracings, and we can set » = N — 1. The distribution vectors will 
remain unchanged for ¢ > WV. 

Another way of looking at it is by noting that for ¢ = N, the distribu- 
tion vector is the sum of eigenvectors, each of eigenvalue unity. All the 
components of these eigenvectors vanish, except one of the (7, 2) com- 
ponents. There are exactly N such eigenvectors, one for each 7. 

It appears, therefore, that the final distribution vector D [@, 7), N] 
will have non-zero components only where i = j. Each of these compo- 
nents with index (i, 7) will represent the probability that there exist paths 
from an arbitrary neuron to exactly 7 other neurons. In particular, the 
component (NV, NV) is the “‘strong connectivity” of the net, which, as will 
be recalled, was defined in the previous paper as the probability that 
paths exist from an arbitrary neuron to ai] the other neurons of the net. 

To find the weak connectivity y, we take the expected value of x/N 
at ¢ = N, so that 


¥=5 DID), NI. (9) 


In the following numerical example V = 4, a = 2. Our transition ma- 
trix, computed from (4), is shown in Table I. 


TABLE I 


ORT Le elie 1.3 Dad (21s 2)2) 2.3) 2 4 eee 1 Sit SS ee Sie ee elm em ee es 
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The initial distribution vector of 17 components is CFOs Oo oe Nee ts 
t = 4 the (final) distribution vector is 


D4) (0720625, 0.0.1 407 ee Oe oy 2 (10) 


CONNECTIVITY OF RANDOM NETS 57 


The non-zero components of this vector give the probabilities that there 
exist paths from an arbitrary neuron to 1, 2, 3, and 4 neurons respectively. 
In particular, the last value, .481, is the strong connectivity of the random 
net with total population 4 and axone density 2. 

The weak connectivity is 


+(1X%.062542X.1407+3X.3164+4X.481) =.804. (11) 


The value of vy given by the approximate equation (cf. Solomonoff and 
Rapoport, loc. cit.), 
y=1-—e-%, (12) 


turns out to be 0.8, a good approximation for this case. 
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The propagation of a single act in a large population is supposed to depend on 
some external circumstance and on an “imitation component,’’ where encounters 
with individuals who are performing or have already performed the act con- 
tribute to the tendency of an individual to perform it. The “tendency”’ to per- 
form is supposed to be measured by the average frequency of stimuli, randomly 
distributed in time, impinging on the individual. The deduced equation is a 
relation between the fraction of the population who have performed the act and 
time, provided the time course of the “external circumstance” and the way in 
which the imitation component contributes are known. Several special cases are 
studied, in particular, cases without the imitation component, cases with imita- 
tion only, and various mixed cases. Examples are given of social situations in 
which such factors may operate and general suggestions are made for the sys- 
tematization of observations and/or experiments to test the assumptions of the 
theory. 


By a single act, we shall understand one which is performed at most 
once in the lifetime of an individual or, more generally, at most once 
during the interval of time under consideration. Examples are suicide; 
marriage; conversion to a faith; or buying an item of lasting use, such as 
a car, a house, etc. We are going to consider the propagation of single acts 
throughout a large but finite population. 

We shall suppose that two major influences contribute to the probabil- 
ity that a randomly selected individual will perform the act at a randomly 
selected instant of time, namely, 1) the pressure of external circumstances, 
and 2) the social imitation component. N. Rashevsky (1951) and H. G. 
Landau (1950) postulate for each individual a characteristic inherent 
tendency to perform an act. In their treatments the act is not generally a 
single act, inasmuch as it can be performed any number of times. The in- 
herent tendency, moreover, is characteristic of the individual rather than 
of the external circumstances. Rashevsky and Landau suppose that to 
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this tendency a social imitation component is added, which depends on the 
number of individuals performing the act. Thus in their treatments the 
inherent tendency is usually independent of time, and the social imitation 
component only implicitly a function of time. In our treatment, however, 
the external influence, being independent of the individuals, will, in gen- 
eral, be an explicit function of time (a constant in special cases), while the 
imitation component will depend on the number of individuals who are 
performing or have performed the act (exactly once). The individuals who 
have performed the act will be called conforming individuals. 

To take a concrete example, consider the motorists driving along a well- 
traveled highway at sunset. The tendency to turn on one’s headlights pri- 
marily depends on the degree of darkness, which, in this case, is a mono- 
tone increasing function of time. There is, besides, another component 
contributing to this tendency, namely, the frequency with which one 
meets cars with their headlights already turned on. This is the social 
imitation component. Another example is a panic, revolution, or riot. The 
external circumstance may-be a progressively unbearable environment. 
The social imitation component may exert its influence through the num- 
ber of individuals already exhibiting the behavior in question. In special 
cases, the external circumstance may be an instantaneous stimulus, such 
as the cry “‘Fire!”’ in a theater, or a single shot precipitating a stampede 
of cattle. 

The theoretical importance of the single act in mass behavior may well 
lie in the ease with which the deduced equations may be tested. Statistics 
on suicides, marriages, joining unions, etc., are readily available. In special 
cases, such as our traffic example, data may be obtained under a variety 
of rather well-defined conditions and even experimental controls can be 
introduced. 

It is intuitively almost evident that the ways in which the various single 
acts will be propagated will show significant similarities and differences. 
From the standpoint of theoretical social psychology, it would be interest- 
ing to determine whether these similarities and differences correspond to 
those predicted by the theory. For example, the social imitation compo- 
nent imposes a certain modification on the imitation-free curve. It would 
be interesting to compare actual curves taken, respectively, from situa- 
tions where imitation is or is not expected to occur, and thus test the 
theory. For example, the increase of turned-on lights in houses at the 
onset of darkness is probably independent of imitation—one does not 
look to see whether the neighbors have their lights on. It would be inter- 
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esting to see if the “‘turned-on headlights’ ” curve (for which the operation 
of the imitation component is assumed) departs as predicted from the 
“turned-on house lights’ ” curve. The death curve of an age group should 
also be imitation-free, while the suicide curve may possibly show imita- 
tion effects. If one finds an imitation effect in the traffic case, one could 
inquire whether such an effect is reduced or disappears entirely on a one- 
way road, etc., etc. 

Another basis of verification may be found in comparing situations 
where the imitation depends on the number of individuals who have al- 
ready performed the act (the number of conforming individuals) with 
those situations where imitation depends on the number of individuals 
actually performing the act (rate of conformance). Our traffic example 
illustrates the former, while suicides (or runs on banks) may illustrate the 
latter. There may be mixed cases, such as marriage, where the tendency 
to marry may depend both on the number of one’s married friends and on 
the frequency with which one hears of people getting married. 

Our approach to the problem will be similar to that recently developed 
in the probabilistic theory of neural nets (Rapoport, 1950). 

As in that theory, the intensity of a gross stimulus will be described in 
terms of an average frequency of randomly distributed instantaneous 
events. The “‘most random”’ distribution of events along a time axis is the 
Poisson distribution, in which the events are entirely independent, i.e., the 
fact that a certain point on the time axis is occupied by an event has no 
bearing on the probability density distribution of the time of the next 
event. 

We shall assume that the events associated with the tendency to per- 
form an act are Poisson-distributed. A change in the intensity of the 
tendency means a change in the average frequency of the events, while the 
random character of the events is preserved. 

Let this frequency be denoted by « = x(t). Suppose that the threshold 
of activity is one, i.e., the impingement of a single event results in the 
performance of the act. We seek the probability distribution of the times 
when the act will be performed. That is to say, in a large population, all 
subjected to a firing frequency «(#), we wish to determine the fraction of 
the population which will be performing the act in a given short interval 
of time (t, t + dt). 

For the act to be performed in this interval, it is necessary and suf- 
ficient that the act has not been performed at any instant prior to ¢ and 
performed at f. Divide the interval (0, #) into » small sub-intervals, each 
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of length At, denoted by their end points h, h,... tn = 1. lf, the stimuli 
are Poisson-distributed, the occurrence of a stimulus in any sub-interval 
does not influence the probability of its occurrence in any other sub- 
interval. Consequently, the probabilities of non-occurrence are also inde- 
pendent. Hence we may take the product of the probabilities, 


{TT u-sw@aitsMa=Poar, (1) 


as the probability that no stimulus has occurred at any instant prior to ¢ 
and that a stimulus has occurred at ¢. Dividing both sides of (1) by dt, 
and taking logarithms of both sides, we have 


log P(t) = log x (t) + > log [1 —« (t;) At}. (2) 
i=1 
Taking A? sufficiently small to justify the approximation 
log [-1 — x (t;) At] ~ — & (2;) Af G33) 
and the passage from finite sum to integral, we obtain 
P (t) ‘ 
“= — + 
log [Say |= fo —# (rar, (4) 
whence 
P(t) =x (1) eSemerr. (5) 


The fraction of the population who have conformed at time ¢ will then be 


t t — f°xGr")de’ ’ 6 
f= ff Plr)dr= f x(re dr (6) 
Therefore 


Fasl ect, Mee (7) 


Regarding P(t) as f’(é), we see that equation (6) can be written as a 
differential equation 


df= (1—f)«(t) dt, (8) 


which is simply a reflection of the fact that the probability that an indi- 
vidual picked at random at time ¢ is performing the act is the product of 
the probabilities that 1) he has not yet performed the act, and 2) that the 
stimulus for performing the act is at that instant impinging on him. Dif- 
ferential equation (8) with the initial condition f(0) = 0 leads, of course, 
to the solution (7). 
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For the special case where « is independent of time, the propagation of 
the act is given by 


gel a Og (9) 


which is identical with the equations of radioactive decay, monomolecular 
reactions, etc. Generalizations can now be made in several directions, e.g., 
by introducing a threshold greater than unity, by making x variable with 
time, by introducing the imitation component, and by combinations of 
these. We shall first consider the introduction of a nontrivial threshold, 
while x is independent of time. 

Case 1: x = constant. The introduction of a nontrivial threshold means 
that the stimuli, whose frequency measures the tendency to perform the 
act, must sum in order to effect an actual performance. Mathematically 
the situation is equivalent to that in which a single neuron with a period 
of latent summation o and a threshold / responds to randomly distributed 
stimuli, when / or more of them happen to fire within c. This case, includ- 
ing a postulated refractory period, was treated in an earlier paper (Rapo- 
port, 1951). In our present case, the individual takes the place of the 
neuron. The “‘stimuli’’ he receives which predispose him to perform the 
act may be very complicated neural processes, but for the purposes of this 
paper, they will be treated as Poisson-distributed instantaneous events. 
The meaning of the threshold / and of the period of latent summation 
is then quite analogous to that pertaining to the single neuron. Refractori- 
ness, however, is meaningless, inasmuch as only one response is considered. 

Since in the special case under consideration « is a constant, equation 
(15) of the above-mentioned paper holds, namely, 


g(*%,o) =1—E,-1(xo) e™, (10) 


where g(x, c) is the probability of finding at least 4 — 1 stimuli impinging 
within o units of time prior to an arbitrarily selected moment and £;(z) 
is defined as 


k 
E,(2) = =. (11) 


Our equation (8) for general /, but constant x, then becomes 
df = (1—f)[1—By-2(%o) e*7] di, (12) 


whose solution is 


—log(1—f) =[1 —E,-2(*o).e~*] ¢-+ constant . (13) 
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Therefore, if f(0) = 0, we have 
f = 1 hind el Ea—s'ere eel t . (14) 


Equation (14) is of the same general form as (7), except that the time con- 
stant x is replaced by the expression 1 — E,_2(xa)e~*", which is likewise 
constant in time. Since the fitting of an empirical curve of the type (8) al- 
lows only the gross determination of the time constant, and not the 
quantities « and o separately, there is no way to distinguish this case from 
the one where / = 1. 

Case 2: x is a function of time. If h = 1, f is given by equation (7). If, 
however, k > 1, g(x, «) now becomes g[x(#), o]. Because of the total 
randomness of the stimuli, however, equation (6) of the earlier paper still 
holds and now reads: 


P(t) dt=[1— f P(r) dr] [ede] gle, ol. (15) 
0 


[Par 


by f(é), we have the differential equation 
df= (1—f)[*@1] glx), o] dt, (16) 


Replacing 


whose forma! solution is 


f= 1—ernJS, eee, olde (17) 


Note that we may suppose ¢ to be very small compared with 1/x, and, 
therefore, that x « < 1 (the quantity xo is dimensionless). This is a conse- 
quence of the fact that o is of the order of milliseconds, if it is an actual 
neurological period of latent summation; while 1/x, being the average 
interval between rather complex events, may be of the order of seconds. 
Hence we may consider x constant for the interval of time comparable to 
g. In this case g[x(#), c] in the neighborhood of the instant ¢ becomes the 
probability of finding # — 1 or more stimuli within ¢ prior to f. And this 
is given for a Poisson distribution as 


— ea (te, (18) 


q=h-1 


But this is precisely the expression on the right side of (10). Equation (17) 
now becomes 


f pe oJ, tr)ler(°—E, lel), a}}e—2 (7) edz : ( 19) 
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Here, since «(¢) is not a constant, ¢ enters nonlinearly in the exponential of 
(19). Furthermore, for a given x(t), the way ¢ enters into (19) depends on 
h. Suppose, for example, x = at. Let us examine the behavior of the right 
side of (19) for small values of ¢, i.e., where a series in [x(#)c] is dominated 
by the lowest power. Then 


er(t)e — F,_»[x (¢) a] 
may be replaced by the lowest term of its series representation, namely, 


gh-1gh-1 qh—lph-1gh-1 


IS (1) 
Then the integral within the brace of (19) becomes 
t qhrhgh-l ae eS h t (ara)*e—-277 
=Shh ee l 
0 (k—1)! a Jy h! ; (21) 
= = il 1 — EF, (atc) ema a ‘ 
ao 


Substituting this expression into (19), we see that for small values of ¢, 
approximately, 


f= co e— (h/ac*) [1— EB, (ate) 917] . (22) 


Again approximating 


[i — tata) e-°'") 6 *” 


by 


(ato) *+1 

h+1! ’ 
expanding the exponential, and neglecting all but the lowest power of 
t, we have the approximation 


(atc) A+1 rs argh) 
Ue ae — pe (23) 


Remark. This approximation holds only for h > 1. If h = 1, a has no 
meaning, and the derivation of the function of g does not follow. We have, 


in fact, from equation (7), where # = 1, 


ee (24) 


for small values of t, whereas equation (23) gives for h = 1, f ~ af’. We 
summarize these results in the following 
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Turorem. Let the frequency of impinging external stimuli be x(t) =a 
the threshold h, and the period of latent summation o. Then the fraction of 
individuals who have performed the act at time t is given by 


pee eo JS, alt —Ey,—alare)e— 277 dr . (25) 


which for small values of ¢ is approximated by 


argh —lghtl 


peo ulee , (26) 


The imitation component. Now let each encounter with a conforming 
individual be a stimulus. Suppose first that # = 1 for all stimuli, so that 
either an external stimulus or the above-mentioned encounter is sufficient 
to induce the act. If the frequency of encounters is proportional simply to 
the density of individuals who have performed the act (as is the case in 
out traffic example), we have ) 


df= (h— let) + bf) dt - (27) 


If x is a constant, equation (27) can be solved by separation of variables, 


namely, 
df 


G=N@Pof 
which leads in a straightforward way to the solution 


> x [e@te)t— 1] 


dt (28) 


’ 


Equation (29) is seen to reduce to (9) if b = 0, as, of course, should be the 
case. If « is a function of ¢, the differential equation (29), being nonlinear, 
is not readily solvable. Assuming f to be a power series in ¢, however, 
we can calculate the coefficients of that power series and compare them 
with those where imitation does not occur. Therefore, let 


f= >) ee (30) 
i=1 
and « = at, as before. Then 
df wal. 
or DS oui (31) 
i=1 


and we have 


PEL [1 Ss est |((a+ bcr)t+ bo+. a ] ; (32) 


4=1 


MASS BEHAVIOR 167 
Equating the coefficients of like powers, we obtain 


_ ba 


a=0, =F, Cpe Bie oe 070 


i aa oaeaey mean 


The dimension of a is [T]~; of 6, [T]“; and of c;, [T]#. Therefore 


av? abt? § (ab? — 3a?) t4 
2 6 24 


= (34) 
The curve starts out as a//2 as in the imitation-free case, and is perturbed 
by a positive term abé?/6, which is absent in the imitation-free case. The 
fourth power term in the imitation-free case is —af*/8. The perturbation 
in the coefficient of the fourth power is, therefore, +-ab?/24, etc. 

If kh > 1, we can assign a “‘stimulus strength” to encounters with con- 
forming individuals, i.e., suppose that such encounters are equivalent to 
external stimuli. We shall outline the case for constant «. We must 
replace x«(¢) by our function 


glx (t),o, h] = 1—E,-al« (4) o] e 
and, evidently, if k < h, bf must be replaced by g[x(t), o, h — klbf, since 
an encounter is equivalent to reducing the threshold by & units. If k = A, 


bf remains in the equation with coefficient 1, since an encounter is alone 
sufficient to produce a response. Equation (27) now becomes 


df=(1-f)igle(@),o,h4] +glx(t),o,h—kl]bfjdi. (35) 


We may also mention the case of ‘‘pure imitation,” where the external 
stimulus is negligible. Here «(¢) = 0, and 


ay re 
apa? ies (36) 


Equation (36) is the well-known “‘ogistic’’ equation, whose solution is 


K e* 
STE Ke ae 

where m 
ee OS 38 
Raga: (38) 


Obviously, if f(0) = 0, the propagation of the act will never get started. 
If, however, a “leadership” initiates the act, the propagation gets on the 
way through imitation alone, and eventually will involve every individual. 
Such a model may well represent the propagation of an act after an in- 
stantaneous stimulus, e.g., in a stampede or a panic. 
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One more class of problems will be mentioned, namely, those where the 
imitation factor depends on the rate of increase of conforming individuals 
instead of on their actual number. In actual situations, the dependence 
may well be on both these quantities. In the simplest case, we have by a 
proper modification of equation (27) 


ofan df 
Fo a-p(«ts9 (39) 


If x is constant, the variables are separable, and we obtain 
it 
df [z-? Jee, (40) 
which leads to the solution (assuming the usual initial condition) 


—log(1—f) —bf=at. (41) 


Being of mixed transcendental-algebraic type, equation (41) is not readily 
solvable for f, but it is for ¢, namely, 


t= —=[log(1—) + 5/1. (42) 
Since log (1 — f) < 0, and df = 0, equation (42) has meaning only if 
sel HO a) tec: 
b<— 7 Sel has oleae (43) 


for all values of f(0 < f < 1). This is so if, and only if, 
os1. (44) 


Remark. The meaning of this inequality is a consequence of our tacit 
assumption that imitation is propagated instantly. If b were unity under 
these conditions, the rate of conforming would be infinite. 

Expressed as a time series in f, ¢ can be written as 


r= [saa 42 -+e+...]. (45) 


On the other hand, for the case where imitation depends on the number of 
conforming individuals, we have 


Ae aoe: ese ‘= 
I= [log (1 +5 f)—tog (1-7) J, (47) 
5 


5 (G+) -Gan-a) e+(s543) &--] 4 


a 
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Comparing the series (48) and (45), and each with the imitation-free 
ease, we can get the perturbation of the coefficients of any number of 
terms desired. 

This investigation is part of the work done under Contract No. AF 
19(122)-161 between the U.S. Air Force Cambridge Research Laboratories 
and the University of Chicago. 
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Systems of two, three, and four linear non-homogeneous differential equations 
are examined with a view toward determining whether they can possibly serve as 
mathematical models to describe periodicities in the concentrations of substances 
which enhance or inhibit each other’s rate of production (or dissipation). The 
nature of the model demands that the solutions of the differential equations be 
non-negative at all times, i.e., that all the steady states be positive. Conditions 
for periodicity and for positive steady states are derived, and it is shown that 
these conditions are not always compatible with each other. In particular it is 
shown that certain three- and four-hormone models proposed to account for 
the periodicities observed in the menstrua] cycle cannot satisfy the above condi- 
tions for any values of the parameters and hence are inadequate. 


There have been attempts to describe the periodic fluctuations of the 
concentrations of hormones in the blood stream associated with the men- 
strual cycle in terms of an open system of first-order linear differential 
equations, namely, 


ee he Cas re eet) 
where the x’s are the concentrations of the several hormones. We call such 
a system “open,”’ because the constants ajo, if not all zero, indicate that at 
least some of the concentrations x; may be produced even in the absence 
of any of the substances. Hence the system exchanges matter with its 
environment and is in thermodynamic parlance an open system. More- 
over, the substrates of the substances are all outside the system and are 
supposed to be inexhaustible. This, too, characterizes the system as 
“open.” 

In particular, H. Lamport (1941) examined one such system as a pos- 
sible mathematical model for the ‘push-pull’ theory of hormone inter- 
action, where the follicle-stimulating hormone of the pituitary gland is 
supposed to enhance the production of estrogen, while estrogen, on the 
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contrary, is supposed to inhibit the production (or enhance the dissipa- 
tion) of the follicle-stimulating hormone. Denoting the hormones by F and 
E respectively, Lamport examined the system 


dF 
di = A19 — Q4,F == Qy2H > 
(2) 
dE 
Ane Ay — Aookt 7 


where the a’s are all positive constants and the signs on the right side indi- 
cate the enhancing and inhibitory effects attributed to the hormones with 
the additional assumptions that each hormone dissipates at a rate propor- 
tional to its own concentration and that the pituitary is able to produce its 
hormone independently at the rate a1o, but that the ovary is unable to 
produce estrogen independently. 

It must be noted that there are at least two ways in which one hormone 
may “inhibit” the rate of concentration of another. It may 1. simply en- 
hance its dissipation (or breakdown) from the blood stream, or 2. it may 
act inhibitively on the cells which secrete the other hormone, that is, in- 
hibit its rate of release into the blood stream. As long as the rate of active 
production (or secretion) is positive, the two types of action are formally 
(mathematically) equivalent. However, the inhibitive action of the second 
kind has no meaning once the rate of secretion has vanished. When this 
occurs, the intensity of inhibitive action, represented by the coefficient of 
the inhibiting substance on the right side of equation (1) or (2), must be 
set equal to zero. This abrupt change in the constants does not appear in 
equations (1) or (2). We must therefore assume that such equations pur- 
port to describe inhibitive action of the first kind, where in Lamport’s 
equations the ovarian hormone is supposed to enhance the dissipation of 
the pituitary hormone from the bloodstream rather than inhibit its produc- 
tion or secretion by the gland. This interpretation is not in accord with the 
generally assumed action of ovarian hormones (cf. e.g., Grollman, 1947), 
We shall, however, consider equations of the type described on their own 
merits. 

Lamport was interested in the conditions under which a system such as 
(2) would give rise to periodic solutions and thus would exhibit the fluctua- 
tions in the concentrations of the hormones during the menstrual cycle. A 
necessary condition for periodicity of the system (2) is that the equation 

eed Perr) G2 


ae (3) 


91 Ao —— 
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has a pair of pure imaginary roots. But this implies that ay + ay = 0, 
contrary to the assumption that both are positive. This contradiction, to- 
gether with some numerical considerations regarding relations between 
the magnitude of the constants and the length of the menstrual cycle, led 
Lamport to conclude that a system of two linear differential equations 
cannot describe the periodicity of hormone concentrations in the men- 
strual cycle. He further concludes that this mathematical analysis makes 
the “push-pull” theory untenable. The latter conclusion, however, does 
not seem warranted. Equations (2) are not the only possible mathematical 
description of the ‘‘push-pull” theory. We have already pointed out that 
those equations do not themselves even describe the theory properly (be- 
cause of the interpretation of “inhibition” inherent in them). Moreover, 
other types of equations could give a more adequate picture, for example, 
nonlinear equations or equations with more variables. It is proper, how- 
ever, to examine a particular class of equations with a view of determining 
their possible application to phenomena such as the menstrual cycle. In 
this paper we undertake to examine linear open systems involving two, 
three, and four substances. 

In 1948 R. Kesselman (unpublished) proposed three- and four-hormone 
models. An examination of the models revealed that the conditions for 
periodicity were incompatible with those for positive steady states. Kessel- 
man’s models were thus shown to be also inadequate. 

These investigations led to the general question: Is it at all possible to 
describe the menstrual cycle periodicities of hormone concentrations in 
terms of a linear differential equation model under the assumptions usu- 
ally made concerning the interaction of the hormones? 

One would strongly suspect that linear models are, in general, inade- 
quate. To begin with, conditions of periodicity imposed on such models 
imply rather stringent restrictions on the constants. We cannot expect 
these conditions to be exactly satisfied in nature. If they are only approxi- 
mately satisfied, modifications of amplitude are introduced into the oscil- 
lations of concentrations either in the form of damping or reinforcement. 
Such effects have not been observed. On the contrary, the onset of men- 
struation at puberty and its termination at menopause are either abrupt 
or associated with irregularities not accounted for by damped oscillations. 
Even more serious objections to the linear model stem from the apparent 
ability of the cycle to restore itself following disturbances. Thus a delay 
of a few days in the onset of menstruation cannot be explained by a change 
in the constants, since such a change would affect subsequent periods as 
well, or perhaps disrupt them altogether. 
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More promise is held out by nonlinear systems such as occur, for ex- 
ample, in relaxation oscillations. The fluctuations in such a system are due 
to more or less abrupt passages from one steady state to another rather 
than to continuous periodic changes in concentration. 

Aside from all these qualitative considerations, however, it might be 
worth while to settle once for all the question raised above from a purely 
mathematical point of view, especially in view of the fact that the answer 
to such a question may find applications elsewhere. 

One can answer the question by deriving the necessary conditions for 
periodic solutions of linear systems under the restriction that the solutions 
(concentrations) be non-negative at all times, i-e., under the condition of 
positive steady states. Whenever a given system reveals an incompatibil- 
ity in these conditions, it must be discarded, i.e., a model based on it must 
be considered inadequate. Knowledge of some such necessary conditions 
would enable us to discard many models in some cases on the basis of 
comparatively simple calculations or by inspection. Such considerations 
are, of course, applicable'to any system purporting to describe a model 
where the rate of concentration of each substance is a linear function of the 
concentrations of all the substances in the system. Therefore, aside from 
the question of applicability of such systems to a theory of menstrual 
cycles, the problem has some general theoretical interest. 

The general case. The most general open linear system is represented by 
equation (1). Henceforth we shall refer to the aj) as the free constants and 
to the a,;(7 ¥ 0) as the coefficients. The general solution of such a system 
is in the form 


w= b+ >> Ayje* Gi=1,... 0), (4) 
ya 


where the 0; are the steady state values, the a; are the characteristic roots 
of the matrix (a;;),* and the A,,; are determined by the initial conditions, 
n of them being constants of integration. In order for the x; to be (ulti- 
mately) periodic and always non-negative, it is necessary and sufficient 
that 

1. the 0; are positive; 

2. there is at least one pair of pure imaginary a’s; 

3. the real parts of the remaining a’s are all negative. 


*Tn all subsequent discussion the case of multiple characteristic roots of the matrix (ai;) 
will be discarded. 
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That the conditions are necessary can be seen from the following: 

1. a vanishing or a negative 0; will imply a negative concentration at 
some time for some 4;; 

2. an absence of a pure imaginary pair of a’s will imply that all the 
solutions are sums of constants and/or transient terms (decaying exponen- 
tials) and/or terms with exponentially rising amplitudes; 

3. a positive real part of some a will imply ever increasing amplitude of 
oscillations, hence eventual fluctuations below zero. 

That the conditions are sufficient can be seen from the fact that the 
vanishing a’s will contribute only constants, while the negative real parts 
of the a’s will contribute only transient terms. If pairs of pure imaginary 
a’s occur, the solutions will be ultimately periodic about the (positive) 
steady states b;. The arbitrary A’s can then be chosen so small that the 
greatest occurring minimum will still be sufficiently close to the corre- 
sponding steady state to be non-negative. 

Case: n = 2. For two substances, we have in the most general case 


dx 
vxee Apt 1X + ayy, 
3 (5) 
= = doy) + AX + Any. 
Necessary conditions for periodicity are 
44; + dye = 0; reek Mee (6) 


The conditions for positive steady states are obtained by setting dx;/di = 0 
and solving the resulting algebraic system. The solutions are the 5; and 
must be positive. Hence 


| aa hig A192 211 — 0 
ig area iy ae ee Pee) (7) 
| a3; | ; | a: | 


Suppose conditions (6) are satisfied. Then, whether the a,; vanish or not, 
we can write a2 = —dy. Moreover since |a;;| > 0, inequalities (7) reduce 
to 

(@y9411 + A242) > 0, 


( — 441429 + Ge1d19) > 0. 


(8) 


We have proved 
Theorem 1. A linear open system has periodic solutions with positive steady 
states if, and only if, (6) and (8) hold. 
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That (6) and (8) can be simultaneously satisfied if the free constants do 
not both vanish can be seen from the following example: 


(9) 
dy _ ay 
oie y, 


which does have periodic solutions with non-negative steady states, for 


example, 
x=1-+sin/, (10) 


y=2-+sint—cos?. 

In the light of Theorem 1 we see that Lamport’s model failed because it 
had a non-vanishing trace (as he himself pointed out). A model with a 
vanishing trace implies that either one of the substances is “auto- 
catalytic” (i.e., its rate of production is enhanced by its own concentra- 
tion) or only “‘cross-effects”’ operate (the concentration of each affects only 
the rate of production or dissipation of the other) plus constants denoting 
independent rates of production or dissipation. We stress the rather ob- 
vious importance of the vanishing trace in the two-substance case, because 
this condition is mot necessary in cases where x > 2. Thus, as we shall see, 
Kesselman’s model failed for entirely different reasons. 

Case n = 3. The equation analogous to (3) for this case can be writ- 


ten as 
w —Tyw? +Tow —A=0, (11) 


where 7) is the trace of (a;;), i.e., the sum of its principal minors of order 
one, T; is the sum of the principal minors of order two, and A (which could 
be written as 7) is the “sum” of the principal minors of order three, i.e., 
the determinant of (a;;). Equation (11), being of odd degree, must have a 
real root, and by our conditions above, this root must not be positive. The 
other two roots must be pure imaginary conjugates. Hence the factored 
form of (11) must appear thus: 


(w+ 71) (o+ 72) =e + row’ + riot rin =0, (i2) 


where 7; 2 0 and, of course, r} = 0. Equating the coefficients of (11) and 
(12), we obtain the conditions of periodicity: 
TyS05 Ps 0 ees (13) 


It will now be shown that if A = 0, the case reduces to the case m = 2. 
Let A; be the Cramer determinants, obtained from A by replacing the ith 
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column by the constants —a;. Then, formally solving for the steady states 
by Cramer’s rule, we have 
A; 


b;= 
A 


Oa 7 Si, (on 8 
If A = 0, we must have each A; = 0 for finite steady states. This, how- 
ever, implies 

G39 = C1Aig+ C2d29 , 


(15) 
€3; = 6101; + C29; , 
and, consequently, 
dx; dx, dX» 
Tt = Wepre ers sr 
Integrating (16), we obtain 
Hg = 0141+ Co% 4+ 6. (ali) 


Therefore our third substance may be considered to be composed of con- 
stant fractions of the other two, plus a constant amount ¢o, which does not 
participate in the reactions. Renaming the substances 


t,=%,(1+ 4), (18) 
ty = %2(1+ 6), 


we obtain a system of essentially two interacting substances and an inert 
third one. But this is the case treated above. Hence the only non-degener- 
ate case for m = 3 is where 7% > 0, Le., 


fae 5 Le ANE EEO (19) 


Further examination of equations (11) and (12) shows that for periodic- 


ity we must also have 


To realize positive steady states, we must have, in addition, because of 
equation (14), the Cramer determinants of the same sign as |a;|. Hence 


AEA (21) 


Conditions (19), (20), and (21) now constitute the necessary and suffi- 
cient conditions for periodic solutions with positive steady states of a 
three-substance open linear system. Obviously only very special systems 
will satisfy all three conditions. The interesting question, however, is 
whether a system subject to certain restrictions on the coefficients and the 
free constants can be made to satisfy the conditions by fixing these 
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parameters within the limits of the restrictions. Suppose, for example, all 
the free constants a;) = 0. Then no value of the coefficients a; will insure 
positive steady states. Or suppose all but one of the free constants must 
vanish, and the signs of the coefficients are fixed. Then, depending on the 
distribution of these signs, conditions (19), (20), and (21) may or may not 
be compatible with respect to the system in question. It is questions such 
as these that we undertake to investigate here. 

Theorem 2. If no restrictions are placed on the free constants ajo, any system 
which satisfies conditions (19) and (20) can be made to satisfy (21) by giving 
proper values to the constants ajo. 

Proof. Setting dx;/dt = 0, we can consider the resulting system as three 
equations of three planes. If A # 0, the planes intersect at a point whose 
coordinates represent the three steady states. For positive steady states, 
this point must lie in the first octant. The direction cosines of the planes 
are determined only by the coefficients a,;;. Hence moving the planes by 
translation involves changes only in the constants aio. Thus by giving 
proper values to the ajo, the planes can be so moved, without changing 
their direction cosines, as to place the intersection in the first octant. 

Remark. Theorem 2 can be immediately generalized to any number of 
variables. 

The situation is different if restrictions are placed on the free constants. 
In particular, let it be required that one or two of these constants vanish. 
This means that the corresponding substances cannot be produced (or dis- 
sipated) independently. In our three-dimensional case, the vanishing of 
one dio means that one of our planes must go through the origin. If the 
a;; are fixed, the plane is completely determined. The translation of the 
intersection point of the three planes must, therefore, take place within 
that plane. If that plane has points in the first octant, a translation of the 
intersection into that octant is possible; otherwise not. Now a plane 
through the origin has points in the first octant if, and only if, there is a 
change of signs among its coefficients. We thus have 

Theorem 3. Let ai = 0. Then conditions (19), (20), and (21) are com- 
patible only if a change of sign occurs among the coefficients ay, a2, and ajz. 

If two of the constants aio vanish, the compatibility condition is more 
stringent. For in that case, our translation must take place along a line 
through the origin. The line goes through the first octant if, and only if, all 
three of its direction cosines are positive. These direction cosines are deter- 
mined by the two-rowed minors of the matrix, whose elements are the 
coefficients of the equations which determine the line. Now conditions 
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(19), (20), and (21) all involve two-rowed minors of the matrix a,;, hence 
they may be incompatible. To fix ideas, suppose our system is 

dx 

Gp Mot 1% + Grey + 132, 


d 
<r = dat + any + das 2, (22) 


az 
Gp = Ua 1 Gay + aa32. 


For such a system, a necessary condition for the compatibility of (19), 
(20), and (21) is given by 


Theorem 4. Let ary = a39 = 0. Then for compatibility of periodicity and 
positive steady state conditions, it is necessary that 


a3 a1 


pe Uesee C23.) 


@31 = A32 G32 33 Q33 33 


Proof. Under the conditions of the theorem, it is easy to show that (23) 
is equivalent to (21). 

We emphasize again that we are deriving necessary, not sufficient, con- 
ditions for compatibility. As we shall presently see, Kesselman’s model 
satisfied conditions (19) and (21), and, being of the type considered in 
Theorem 4, it satisfied (23). Yet, as we shall show, for this model (the signs 
of whose coefficients were fixed by the physiological assumptions) condi- 
tions (20) and (23) are incompatible. 

We are now in a position to examine Kesselman’s three-hormone model. 
It involves the follicle-stimulating hormone, an effective resultant of the 
luteinizing and the luteotrophic hormones, and estrogen, denoted by F, L, 
and E respectively. The equations 


a — AL + A3E , \ (24) 
d 
eS = 031 + agol — a33k 


reflect the following assumptions: 
1. F only can be produced independently ; 
2. £ inhibits F; 
3. E enhances L; 
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4. F and L enhance £; . 
5. all substances dissipate proportionately to their respective concen- 


trations. All a’s are positive. 
The matrix (a;;) corresponding to the system (24) is 


—a, 0 —aQ%; 
Oo — Qo2 Qo3 |. (2 5) 
@31 a32 — 433 


We see that the inequality T, < 0 of (19) is satisfied. Conditions (23) are 
tisfied if 
oreo 22033 — 423432 > 0 , (26) 
and this inequality also insures the inequalities T; > 0 and A < 0 of (23). 
Let us, however, look at condition (20). 
Expanding the determinant and the minors, we have 


— G41 (d2033 — A23432) = 


(27) 
— (G41 + G22 + ag) [11032 + (22433 — @o3432) + (@y1@33 + @1343;) |. 
Simplifying (27), we obtain 
= G41029 — G4 (441433 + G13G31) — (dee + G33) @11 22 (28) 


— (22+ a3) (22033 — e3A32) — (deg -+ G33) (411433 + G13431) = O. 


But (28) contradicts (26), since if (26) holds, all of the terms of (28) are 
negative. Hence Kesselman’s model fails. 

Remark. In this particular case we had to examine all the conditions of 
compatibility. However the inadequacy of a model may sometimes be- 
come apparent earlier, since only ove of such conditions need fail to reveal 
the inadequacy of the model. Suppose, for example, that estrogen inhib- 
ited instead of enhanced the luteinizing hormone. Such a situation would 
not qualitatively contradict the “push-pull” theory, because according to 
it the hormones of the pituitary still enhance that of the ovary, and the 
hormone of the ovary still has a negative back-action on the hormones of 
the pituitary. However under this assumption we should have in the 
matrix (25) —ds3 instead of a3, and inequalities (23) would be immedi- 
ately violated, since the third two-rowed minor would then be 


— Q93 0 
| = — 09303; < 0. (29) 


— Q3g_ 31 


We now pass to the conditions for compatibility of the four-substance 
model. 
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Case: n = 4, The condition of periodicity now becomes 
w4 —Ty3 + Tow? —T3w +A=0, (30) 
where the 7’s have meanings analogous to those in the previous case. 
Among the roots of this equation there must be a pair of pure imaginaries, 
the other two roots being complex with non-positive real parts (in particu- 


lar, they may be real non-positive or pure imaginary). At any rate, the 
factored form of (30) must be 


(w+ 7?) (w+ a-+ b;) (w+ a — B;) 


(31) 
=a 2tos 4 (ab? 7?) wo? + 2a re 4 7? (a? b?) = 0, 
where a = 0. Comparison with (30) shows that 
Us Os Do Ue Az 0. (32) 


Of interest is the case where A > 0. The condition analogous to (20) is 
TIDE s = Ts 4 AT i (33) 


If only one of the constants aio or three of these constants vanish, the con- 
ditions for compatibility are entirely analogous to the corresponding ones 
for n = 3. The intermediate case, where exactly two of the free constants 
vanish, is more involved. Suppose a@39 = dso = 0. For a non-degenerate 
case, we must have at least one non-vanishing two-rowed minor in equa- 


tions 


044% + Gy2%_ + 13%3 + AyuX, = 0, (34) 


1X1 + Aa2%q + A23%3 + dx, = 0. 
Let ad — @yd2, ~ 0. Then the hyperplane determined by (34) must 
have points with all positive coordinates, i.e., equations (34) must be satis- 
fied by some combination of positive «’s. Solving for «; and a in terms of 
x3 and x4, we have 


Q13 «42 a4 ay 
Qo3 = 22 Q24 29 
t= — V3 —%a ’ 
a4, Ay ay, Ay 
9 422 Qo; 22 
(35) 
ay 413 ay, ia 
Qo, 23 Qo, G4 
A eee ty 
Qj, Ap @, 2 
a1 O29 Qo a2 
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which can be written as 


(41, G22) 1+ (dis, Qo) X3+ (Aya, Qo) Xe= 0, (36) 


(Q11, G22) X2-+ (dy, G23) X3+ (An, Qo4) X4= 0. 
Here the parentheses are obvious abbreviations for the two-rowed determ- 
inants in (35). Equations (36) can be satisfied for some choice of positive 
«’s only if a change in sign occurs among the coefficients of each equation 
in (36). If this does occur, there are eight distinct cases symbolized as sign 
arrangements of the terms on the left side of (36) as follows: 


7 +b = oP a ee ae 
+t—' 4-45 -4+-5° == 4+ 
(37) 
ae ep 
+--+) F++4+—-' -=-+  -4+-— 


Cases I can always be satisfied by a proper choice of the x’s. Cases II can 
be satisfied only if certain inequalities hold. We shall examine one such 
sample case, namely, 

Ax,+ A3x3— Ayx,=0, 


AX, — B3x3 + Bux, = 0 : 


(38) 


where now the A’s and B’s are the absolute values of the coefficients in 
(36). Remembering that x, and x2 may be arbitrary as long as they are posi- 
tive, we write 

AsX%4— A3xzs>0, 


(39) 
— Buxs+B3x3>0, 
whence 
Bux, Ayx, 
B; <@iin < A; : (40) 
Since also x, is arbitrary, we must have 
By Ay 
B, Sars (41) 


These results can be summarized in 

Theorem 5. If two free constants in the case n = 4 vanish, the condition of 
compatibility is satisfied if a) there is a change in sign occurring among the 
coefficients of each equation in (36) and b) either a pair of coefficients of the 


same variable have the same sign or corresponding inequalities of type (41) 
are satisfied. 
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One of the four-hormone models offered by Kesselman involves pro- 
gesterone P, which enters the system in the following equation 


oF > asl —o4P, (42) 
which states that progesterone is enhanced by the luteinizing hormone and 
dissipates proportionately to its own concentration. No action of pro- 
gesterone on the other hormones is assumed here. Then the first three 
equations are independent of P and must satisfy the conditions of com- 
patibility separately. But we have seen that they do not do so. Hence also 
this model fails. If it is assumed that progesterone does affect the other 
hormones, corresponding terms must be added to the other equations, and 
the resulting system tested for compatibility in accordance with the 
criteria developed for the case m = 4. We note that if an inhibitory action 
is assumed by progesterone on the luteinizing hormone, as Kesselman 
suggests quoting C. D. Turner (1948), and if progesterone cannot be pro- 
duced independently, the conditions for positive steady states fail, as can 
be readily verified by examining the expanded matrix. Similar results ob- 
tain if progesterone is supposed to inhibit only the follicle-stimulating hor- 
mone or both of the pituitary hormones. 

Generalizations of the foregoing discussion to cases with a greater num- 
ber of variables is laborious but straightforward. This method may serve 
as a means of discovering incompatibilities in models proposed for cyclic 
phenomena based on linear open systems where restrictions (such as posi- 
tive concentrations) are imposed by the nature of the case. Timely dis- 
covery of such incompatibilities may serve to save a considerable amount 
of labor in the attempts to test the theoretical value of the model. 

This investigation is part of the work done under Contract No. AF 
19(122)-161 between the U.S. Air Force Cambridge Research Laboratories 
and the University of Chicago. 
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THE EXCITATION OF THE HEART AND ITS MODIFICATION 
UNDER THE INFLUENCE OF THE CHEMICAL MEDIATORS 
AND THE CARDIAC NERVES: I. SOME GENERAL CONSE- 
QUENCES OF THE ONE-FACTOR THEORY 
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A generalized form of the one-factor theory is applied to the heart. It is found 
that the effects of acetylcholine and sympathin on the chronaxie can be ac- 
counted for only if the decay constant of the excitatory state increases in the 
presence of acetylcholine, and decreases in the presence of sympathin. With the 
assumption that this is the only effect of the chemical mediators, it is shown that 
upon application of acetylcholine to the heart the theory predicts: 1) a rise in 
the rheobase; 2) an increase in the time required for excitation by constant cur- 
rents; 3) no change in the threshold with condensor discharges of brief duration; 
4) a decrease in the spontaneous heart rate; 5) a sudden and complete inhibition 
of the spontaneous heart rate with excessive concentrations of acetylcholine. In 
general, the effects of sympathin predicted by the theory oppose those of 
acetylcholine. 


During the last twenty years, various formal mathematical theories 
have been proposed and developed to account for the excitation and con- 
duction properties of nerve fibers (see Rashevsky, 1948). Since, in many 
instances, the excitation processes in nerve fibers run parallel to those in 
cardiac muscle, it is reasonable to suppose that the same formal theories 
will be useful in discussions of cardiac excitation. For example, H. D. 
Landahl and J. B. Kahn (1948) have been able to account for the effects of 
subthreshold currents on nerves in terms of Rashevsky’s two-factor the- 
ory. These results, together with the fact that the observed effects of sub- 
threshold currents on nerve and heart are similar (Gilson and Peugnet, 
1932), indicate that the two-factor theory may be able to account for 
these phenomena in the heart. 

The purpose of this paper will be to develop, with a minimum number 
of plausible assumptions, certain qualitative features of a generalized form 
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of Blair’s one-factor theory, which will be applied to the heart. This par- 
ticular theory has been selected because of its simplicity and because it can 
be regarded as a first approximation to the other theories. 

If a stimulus S(é) is applied to an excitable structure, then, according to 
what is, perhaps, the most general form of the one-factor theory, the 
change in the excitatory state ¢ of that structure will be given by 


SFIS] ke, (1) 
where & is a constant and F is some function of S and, consequently, a 
function of the time ¢. Excitation is to take place as soon as ¢ reaches a 
threshold h, which is constant. When the structure is stimulated by means 
of an electrical current, it has been assumed that F(S) = KJ, where I is 
the intensity of the current and K is a constant of proportionality. If a 
constant current is used and e(0) = 0, the following expressions for the 
chronaxie 7, rheobase R, and time é* required for excitation may be readily 
derived from (1) (Rashevsky, loc. cit.): 


2 
a6 


be (2) 
kh 
1 Vf 
oe ee ee 
1* => log. yy. (4) 


It has been demonstrated by numerous investigators (cf. Rosenblueth, 
1950) that the vagus and sympathetic nerves exert their effects on the 
heart primarily by releasing acetylcholine and sympathin respectively 
with each impulse. Since the qualitative results of experiments are inde- 
pendent of the method of application of the mediator (i.e., local applica- 
tion or nerve stimulation), we will hereafter refer only to the concentra- 
tion of the mediator. 

It has been observed that the chronaxie of the heart will decrease upon 
the application of acetylcholine and increase upon the application of 
sympathomimetic substances (Fredericq, 1928). If we assume that equa- 
tion (2) applies to the heart, then from the above empirical result it fol- 
lows that k must increase upon the application of acetylcholine and de- 
crease upon the application of sympathin. We shall assume that this is the 
only effect of the chemical mediators, and show that the one-factor theory 


qualitatively predicts the following events, each of which has been experi- 
mentally observed. | 
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1. The rheobase R increases in the presence of acetylcholine [observed by 
Ashman and Garrey (1931)]. This result is predicted by equation (3). 

2. The time t* increases in the presence of acetylcholine [observed by Ash- 
man and Garrey (loc. cit.)]. Differentiating ¢* with respect to & in (4), and 
using (3), leads to the expression 


aie h iF I 
Dhue ke (1 Reena oo oR. ©) 
The relations 
R 
LT eehie (6) 
and 
1 1 
ff 1 ekt* (7) 
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can be obtained readily from (4). Because of (4), (6), and (7), (5) can be 
written as 


CU alt (beng (3? t 
orale TA A). 3) 
Since for /* = 
ot* 
a= en) 9 
= Ok t*=0 , ( ) 
and for t* > 0 
1 
a kt* * = . 
it follows that 
ot* 
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and, therefore, ¢* will increase in the presence of acetylcholine. 

3. When excited by condenser discharges of brief duration, the threshold of 
the heart is not significantly altered by acetylcholine [observed by Gilson 
(1939)]. 

This finding can be accounted for as follows. For a condenser discharge, 
I may be written in the form 

1=Be*, (11) 


where B and 0} are constants. It will be shown that with a sufficiently large 
but fixed value of b, the smallest value of B that will excite is independent 
of k. This value of B may be obtained from the condition that the maxi- 
mum value of ¢ is #, that is, the condition that e = # and de/dt = 0 
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simultaneously at a time ¢ = ¢’. Inserting (11) into (1), and integrating 
with «(0) = 0, yields 
= KB —kt — p—bt (12) 
eo Res k (e é ) . 
As b increases, the time taken by the condenser to discharge decreases, and 
we naturally expect the time for excitation to decrease. Therefore, if we 
/ 
choose b large enough so that 6 > & and et’ ~ 1 — kt’, then, fort <./, 
(12) reduces to 
= AB ee). (13) 


Since at ¢ = t’, de/di = 0, (13) implies 


bet’ =k, (14) 
or ' ; 

‘== > Ns: 

t p Be |: (15) 


Inserting (15) into (13), with ¢ = ¢’ and e = h, we obtain the relation 
KB k Rk b 
San ees moe 16 
5 (1-5 Flom g) = ui 


Equation (16) may be rearranged to read 


Ke ae 
( nn es 2 
and, as 6 becomes larger, B approaches 

bh 


which is independent of k. 

4. The interval between heart beats T increases upon application of acetyl- 
choline and decreases upon application of sympathin [experimental evidence 
cited by Rosenblueth (loc. cit.)]. The fact that a heart beats spontaneously 
must be interpreted, in terms of the present theory, to mean that e is 
greater than zero (ie., € = h) at the beginning of each excitation. It fol- 
lows from equation (1) that in a spontaneously beating heart, regardless 
of the mechanism involved, F(S) cannot be identically zero, even when no 
external stimulus is applied. If, during the interval between beats, F[S(é)| 
is continuous and independent of the concentration of acetylcholine, then, 
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without any further assumptions about F, it will be shown that 
OT /dk > 0. 


First note that the general solution to (1) may be written as 
t 
c(t) =€(0) e+ ef FILS (0) ] eae, (19) 
0 


where ¢ = 0 at the time of the preceding beat. 

In equation (1), & refers only to the decay of e, and for this reason we 
will assume that & is independent of the type of stimulus involved. There- 
fore, the magnitude of & can be calculated from equation (2) if we know 
the value of r. The chronaxie of the dog’s auricle has been measured and 
found to range from 2 to 5 milliseconds (Brucke, 1930). This result and 
equation (2) yield a magnitude of k which is of the order of 200 sec.—. Be- 
cause hearts with a period of less than 0.2 sec. are rarely encountered, the 
term ¢(0)e—*‘ may be neglected in equation (19) at the time excitation 
occurs, leaving 


Oe et [FILS (8) lekde. (20) 
0 


The exponential e” is positive everywhere, and, therefore, the mean value 
theorem may be applied to the integral in (13), yielding 


aie eH P(S(E)1 f etae 
0 
Se (1 — e—*t) 


(21) 
SES] 
a k 
where 0 < & Si. 
At excitation, t = T, « = h, and (21) becomes 
AD) ope (22) 


Since F[S(é,)] depends on both ¢ and &, the notation may be changed by 
defining 

; FUS(é) 1 =F @, &). (23) 
The relation obtained from (22), 
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implicitly defines T as a function of k, and implicit differentiation of (24) 


yields 
Of 
wae =e 


= i (25) 


Ot /i=r 


To prove that d7/dk > 0, it will be sufficient to show that both the 
numerator and the denominator on the right-hand side of (25) are positive 
within the range of T. With this end in mind, we may write, by a compari- 
son of (20) and (21), 


t 
f(t, kt) = ke f PLS (8) ] d8, (26) 
0 
and, differentiating (26) with respect to k, we arrive at 
t 
= (1 — ft) ent ft EUS C633] cdot ke { OF(S (6) ] ede. (27) 
0 0 


Combining (20) and (22) yields 
h eh! "F S kod 
eres PTS (8 9 2 
e i [S (6) ] eda, (28) 
and it follows that 
Aig 
(53), = (1 aT) ht he f~ OF [S (6) ]e*do. (29) 


Let us assume, for the moment, that F[S(@)] = Ofor0 < @ < T. Then, 
an application of the mean value theorem to the integral in (29) leads to 
the expression 


0 T 
Gi ae (1 AD) htnke? f F[S (6) ] ed@ 


(30) 
= (1—kT)h+nkh, 
where 0 < 9 < T. Now n cannot be equal to 7, for if it were, then 
yi gs 
E FS (@ keg-O == “il 
J FIS(@) Le Jo OF{S (0) 1 eas, (31) 


and, differentiating both sides of (31) with respect to T, 


if F[S(6)] ed@+T7F[S(T)] et? =TF[S(T)] 7, (32) 


or 


J FIS(8) Leda =0. (33) 
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But the integral in (33) cannot be zero, since according to (28) it is equal 
to he. Then 0 < » < T, and from (30) the numerator N in (25) is given 
by 

N=h—-( =; =k(T—n)h>0O. (34) 

OR) ,=7 

Expression (34) was derived on the assumption that F[S(6)] was positive 
for0 < 6 < T. If F has negative values within this interval, the inequal- 
ity V > 0 holds a fortiori, since, according to (29), the contribution of 


5)» 


to (34) will be smaller. 
To prove the denominator, 


(3 
Ot/i=7' 


in (25) is greater than or equal to zero, first note that F[.S(@)] was assumed 
to be a continuous function of time (for 0 < 6 < T) and hence, according 
to (26), f(t, &) is a continuous function of time. Thus it is permissible to 
expand f(t, &) about the point T and arrive at 


fC dt) =f Cb) -(3 at. (35) 
lf 
af 
ee 
then 
EDs ee Te, (36) 


The time required for excitation T is determined by the smallest positive 
root of (24), i.e., the time at which f (é, 2) first reaches a constant threshold 
value which is given by kh. But, according to (36), f(¢, 2) must have sur- 
passed its threshold 2 at a time earlier than 7, and it follows that 


(3 22 (i). (3:4) 
Ot/i=r 
Comparison of (25), (34), and (37) completes the proof that 
oT 
as 38 
aR 9: (38) 


and thus it follows that the heart rate should be decreased by acetylcholine 
and increased by sympathin. 
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Inequality (37) imposes a restriction on F which may be useful in the 
future development of the theory. This restriction is seen more clearly if 
f(t, k) in equation (26) is differentiated with respect to #, for then 


= ae —kt : ko 39 
of 4 (FISO) Be uf FIS(8) le a6), (39) 
and using (37) and (28), we obtain 
FIS(L)] > ke*? [FILS (0) ] ed = kh. (40) 
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5. Excessive concentrations of acetylcholine are capable of completely stop- 
ping the spontaneous heartbeat (cf. Evans 1949). li F [S(¢)] is bounded, 
then F[S(é,)] is also bounded. Let the upper bound of F[S(£)] be given 
by ¢. If the concentration of acetylcholine causes k to increase monotoni- 
cally, and if the value of & surpasses ¢//, then, according to equation (22), 
the heart will stop beating. This is suggestive of the well-known experi- 
mental phenomena in which increasing the concentration of acetylcholine 
causes the heart rate to slow more and more, but when the concentration 
gets sufficiently large, the heart suddenly stops altogether. 

In the next paper, a particular case will be considered in which explicit 
assumptions will be made regarding the variation of F with time, and the 
variation of k with the concentrations of the chemical mediators. 

I would like to thank Dr. George Karreman and Mr. Clifford Patlak for 
reading and discussing this paper. 


LITERATURE 
Ashman, R. and W. E. Garrey. 1931. “The Excitability of the Turtle Auricle during Vagus 
Stimulation.”” Amer. Jour. Physiol. 98, 109-20. 
Blair, H. A. 1932. “On the Intensity Time Relations for Stimulation by Electric Currents.”’ 
Jour. Gen. Physiol., 15, 709-30. 
Brucke, E. Th. 1930. “Vergleichende Physiologie des Erregungsvorganges.”’ Ergeb. d. Biol., 
6, 327-425. 


Evans, C. L. 1949. Principles of Human Physiology. Tenth Edition. London: J. H. Churchill 
Ltd: 


Fredericq, H. 1928. “Chronaxie.”’ Physiol. Rev., 8, 501-44. 

Gilson, A. S., Jr. 1939. “The Increased Accommodation to Electric Currents Produced by 
Vagal Inhibition of the Turtle Atrium.” Amer. Jour. Physiol., 127, 333-37. 

and H. B. Peugnet. 1932. “The Effects Upon Cardiac Musculature of Subthreshold 
Electric Currents.”? Amer. Jour. Physiol., 100, 671-84. 

Landahl, H. D. and J. B. Kahn. 1948. “On the Effects of a Constant Subthreshold Con- 
ditioning Stimulus Upon the Response to a Constant Current Test Stimulus.” Bull. Math. 
Biophysics, 10, 91-95. 


Rashevsky, N. 1948. Mathematical Biophysics. Second Edition. Chicago: University of Chi- 
cago Press. 


Rosenblueth, A. 1950. The Transmission of Nerve Impulses at Neuroeffector Junctions and 
Peripheral Synapses. New York: John Wiley and Sons. 


BULLETIN OF 
MATHEMATICAL BIOPHYSICS 
VOLUME 14, 1952 


THE EFFECT OF ENVIRONMENTAL FACTORS ON THE 
RATES OF CULTURAL DEVELOPMENTS 


N. RASHEVSKY 
COMMITTEE ON MATHEMATICAL BIOLOGY 
THE UNIVERSITY OF CHICAGO 


Progress in social and cultural life is initiated by a small group of individuals, 
who do not follow the socially accepted habits, patterns of behavior, or ideas. 
Such individuals may be conveniently called nonconformists. The rate of cul- 
tural progress depends largely on their number. This number is partly deter- 
mined by purely biological factors, which result in a certain fraction of indi- 
viduals being born with the necessary predisposition. Partly, it is determined by 
the circumstance that the progeny of nonconformists is reared and educated in 
nonconformist behavior. This latter process requires the possibility for the 
existing nonconformists to gather together and to form semi-closed social groups. 
This, in turn, depends to a large extent on the rate of urbanization of the popu- 
lation, which is affected by various environmental factors. 

Mathematical expressions are developed which show how these different en- 
vironmental! factors affect the growth of the number of nonconformists, and 
thus affect the rate of cultural changes. Some very general comparisons with 
known facts indicate the possibility of future applications of the theory. 

The principal result of the study is the fact that social groups with identical 
biological characteristics may show a great difference in cultura] development 
due to differences in environment. 


It probably will be generally agreed that the essence of every cultural 
development and of progress consists of a gradual freeing of society from 
old unfounded beliefs, prejudices, and established habits and customs. 
The cultural importance of habits and customs has been emphasized by 
anthropologists (Benedict, 1950). Freeing society from the prejudice of 
superiority of some classes resulted in the abolition of slavery and marked 
an important milestone on the road of cultural development. The preju- 
dice of racial superiority still exists, though it may be lessening. The free- 
ing of society from the medieval prejudice that free thought and investi- 
gation were sins against the church led to a highiy increased tempo of de- 
velopment of the natural sciences. 

In two preceding papers (Rashevsky 1951b, c; hereinafter referred to 
as I and IL) we have discussed a mechanism by means of which society 
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gradually frees itself of strongly conditioned beliefs and prejudices. De- 
pending on the distribution in the society of certain psychobiological 
traits, a certain small fraction of individuals will overcome the early con- 
ditioning and social pressure which tend to preserve some established be- 
liefs. Those individuals are thus free from those particular beliefs, and 
with respect to them they do not behave like the majority of the society. 
For brevity, we shall call such individuals nonconformists. 

The mathematical expression for the fraction a of nonconformists (II) 
shows that a depends, among other things, on such purely biological fac- 
tors as the average age of the individual in a given society. The quantity 
a varies with time, but tends to an asymptotic value, which is almost 
reached within two or three generations. This asymptotic value of a is 
usually too small to have any appreciable effect on the behavior of the 
majority of the society. 

If, however, we assume that the nonconformists, or at least a fixed frac- 
tion of them, rear their progeny as nonconformists, then, as we have seen 
in I, the ratio ~/N of nonconformists to the total population tends to 1 
according to the expression [I, equation (25)] 


a a rer (1) 
where yp is the relative rate of increase of the total population WV. If the 


death rate is equal to v, then N varies exponentially according to 


N = Nye," (2) 
where 8 = uw — vp. 

The constant a is essentially a biological quantity. It represents approx- 
imately the fraction of individuals born in the society who possess psycho- 
biological predispositions which cause them to become nonconformists. 
The quantity is the total number of nonconformists, both the innate 
ones and those who become nonconformists as a result of their upbringing 
and education. Though a depends on the age at which a prejudice begins 
to be conditioned (Rashevsky, 1951c), nevertheless it does not vary a 
great deal, while 1 increases.according to (1). 

Let us consider somewhat more closely the derivation of equation (1). 
If a fraction a of the newborn individuals eventually becomes noncon- 
formists, and if all individuals born of nonconformist parents and reared 
as nonconformists permanently so remain, then we have: 


= Bn + au (N =n). (3) 
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Introducing expression (2) into (3), and integrating with the initial con- 

dition 
n = 0 fort = 0, (4) 
we obtain equation (1). 

The assumption which leads to (3) tacitly implies that both parents are 
nonconformists. It also implies that any nonconformist parents can rear 
their offspring as nonconformists, in spite of the social pressure of all other 
individuals. Clearly this is not realistic. If a is very small—say, ~ 0.01 or 
less—then, in a society with a small density of population and a uniform 
spatial distribution, the probability of a nonconformist marrying another 
nonconformist is also very small. Even if this happens, their children are 
exposed to the influence of the average members of the society, and rearing 
them as nonconformists becomes practically impossible. In order to satisfy 
the assumptions implied in the derivation of (1), two conditions are neces- 
sary. First, the population density must be large enough to bring a suffi- 
cient number of nonconformists within a sufficient spatial proximity to 
each other, even for a small a. Second, the absolute number of noncon- 
formists who do come together must be sufficiently large to form a more 
or less closed group, in which the children may be reared with a minimum 
disturbance from the rest of the society. 

If, as is usually the case, the individuals of a society are not distributed 
spatially uniformly, but divide into urban and rural populations, then it 
may well happen that while the conditions for (1) are not satisfied in the 
rural population, they are satisfied in the urban. If that is the case, then 
the following happens. Let V,, denote the total urban population. Of the 
aN nonconformist individuals born in the society as a whole, only aNV.,/N 
will be able to rear their progeny as nonconformists. This will have the 
same effect as if we substitute in equations (3) and (1) for a the value 


a lV 
Pees) 5 
Wo (@) 


Qe 


which we may call the effective a. With these simple assumptions, the larger 
N,/N is, in other words, the larger the urbanization, the larger the effec- 
tive a, and the faster the spread of nonconformism in the society. 

This picture is still too oversimplified. We shall now introduce several 
complications, one at a time, and thus arrive gradually at more realistic 
results. 

First, we observe that the ratio V,,/N is not constant but increases with 
N (Rashevsky, 1947, chap. x; 1951, chap. xviii). The form of the relation 
between W,,/N and N depends on the assumptions which we make about 
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the shift between rural and urban population. As an illustration only, we 
shall use here one of the simplest possible assumptions made before, the 
consequences of which lead to agreement with some observed data 
(Rashevsky, 1947, chap. x; 1951, chap. Xviil). 

As before, we shall assume that the equilibrium between rural and ur- 
ban population is determined by the equality of the production per capita. 
We consider that the rural production per capita is of the form 

ay 


a att De 6 
Ben eee? ( ) 


where a; and a, are constants, and NV, denotes the total rural population. 
The total urban production per capita is considered as constant. Putting 


Pipe —c?, (7) 
we obtain ty : 
C 
aol a Me, 8 
W 1 W? (8) 
or 
Nyi=N-—c?. (9) 


Let us consider the possible meanings of the constants a, de, and c?, and 
their dependence on other factors. 

If f denotes the fertility of the soil per unit area, that is, the maximum 
possible yield; if S denotes the total rural area; and if a denotes the maxi- 
mum area which one individual can cultivate, then for very small values 
of V,, when everyone has much more land available than he can cultivate, 
we have roughly 

p,=af. (10) 


When NV, > S/a, the area available for each individual is S/N, and we 
have 


ye (11) 
These two limiting cases are obtained from (6) if we put 
a= S/f% a=". (12) 


We explicitly neglect in this crude zeroth approximation the diversifi- 
cation of rural activities, such as agriculture, cattle breeding, etc. 

As regards p,, it may depend on the following factors. First, we notice 
that the less the fertility f, the more inducement an individual finds to 
move to the city. With smaller f, the rural population has to devote all of 
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its time and effort to agricultural pursuits and cannot take care of other 
needs, such as production of clothing, building materials, etc. More of 
these items are produced in the city. The existence of neighboring coun- 
tries with high agricultural production will tend to increase p,. The reason 
for this is that such countries are likely to be good markets for items pro- 
duced in the cities, Climatic conditions also affect p,. Thus in a very warm 
climate, where production of clothing and shelter is a secondary activity 
and is not essential for life, there is no need for manufacturing enterprises, 
which are located in cities. Larger available mineral resources also increase 
pu, Since mining and metallurgical activities are definitely city occupa- 
tions. As a crude approximation we shall assume all the relations to be 
linear within a certain range. Denoting by c, ¢2, cs, c4, and c; five positive 
constants, by NV, the population of neighboring agricultural countries, by 
7 the average yearly temperature of the region, and by M the mineral re- 
sources, we may put 


pu= 61— C2of + 63Na— Cat + 65M . (13) 
From (7), (12), and (13) we find 


Sf S 


eee ——, 
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(14) 


Assuming an exponential increase of the population, we obtain, by intro- 
ducing (2) into (8) and the latter into (5): 


asa (1-3 ec). (15) 
0 


Thus a. < a, but tends asymptotically to the “biological” value a. 

Now, however, a. is a function of time. Substitution of a, for a in equa- 
tion (3) now changes the form of the equation, and we do not obtain ex- 
pression (1) now upon integration. The differential equation (3) still re- 
mains linear and can be solved by standard methods. The solution, how- 
ever, leads to integrals of the form: 


Fm { — grant (act/BNy) Ae) (16) 


N 
It is readily seen that expression (15) leads to a more decelerated rate 
of increase of nonconformists than that given by (1). In line with what we 
said in the introductory pages, we may say that expression (15) leads to 
a lower speed of cultural development. 
Although the assumptions made here are very inaccurate and were used 
only as an illustration of the method, it may be worth discussing equations 
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(14) and (15). As we see, c? increases with S, f, 7, and a, and decreases with 
N, and M. Hence for the same span of development time we would expect 
very large countries with fertile soil to be more retarded than others. A 
large Ny means a larger average a., but, if 8 is the same, it means at any 
time a greater population NV. Hence we should expect a retardation in 
countries with very large areas. A warm climate also reduces a,. Larger 
mineral resources increase a.. We actually see that the small, highly ur- 
banized city states of Greece exhibited a very rapid cultural development. 
The Western European countries, smaller both in area and population 
than Russia, showed a much more rapid progress both in socio-political in- 
stitutions and in technology than did Russia. A comparison of Western 
Europe with China and India also shows the same trend. Factors which 
affect a, in opposite directions may affect each other. The Eskimos are 
characterized by small No, very small 7, and negligible f. But then V, and 
M are also near zero. The obviously small value of their a, need not, 
therefore, be surprising. 

We now shall introduce the next complication. Equation (5) is obtained 
under the assumption that in the cities both the requirement of the neces- 
sary minimum density of the nonconformists and the requirement of the 
necessary minimum number to form a semi-closed group are satisfied. 

The first assumption is more likely to be realistic. While cities vary in 
their population density, this variation is not as large as the variation in 
population density between city and country. The discussion of the pos- 
sible effect of the density of city population requires a further elaboration 
of the theory of formation of cities, and for the present we shall not con- 
sider it. 

The second assumption can be readily generalized at this time. Let the 
necessary minimum number of nonconformists be g. This situation is quite 
likely: although for a particular society the total urban population 
N,, > g, the number of cities is so large that in each city the population 
is less than g. Consider first the unrealistic case in which all cities are of 
the same average size n,. Then a, is different from zero if, and only if, 


any,> g. (17) 


The problem of the average size of cities is closely connected with the 
problem of distribution of city sizes, and an approach to it has been sug- 
gested elsewhere (Rashevsky, 1947, chaps. xi-xii). Here, again as an illus- 
tration, we shall consider a much cruder approach to the problem. 

Since there must be a constant interchange of goods between city and 
country, the number m of cities is roughly determined by the area s from 
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which supplies can still be received. In a very large country if all the urban 
population should be concentrated in one city, it could not exchange the 
necessary goods with the whole rural population. The size of the area s 
depends on many factors, such as roads, rivers, etc. The area s increases 
with increasing improvement in methods of communication, and is, there- 
fore, itself a function of the technological development of the country. For 
the present we shall consider it, however, as constant. The area s may de- 
pend also on the specific seashore line 1, of the country, that is, the ratio of 
the length of the seashore line to the area. The larger /,, the more cities 
will be located on the seashore. Sea communication in the past has always 
been easier than land communication. Greater distances from the city can 
be more easily reached when /, is large. We may put 


sa se EB, (18) 
We have approximately 
m=", (19) 
Ss 
and 
a du oN aS 
From (17) and (20) we find 
aNys 
<> 21 
wets (21) 
or P 
oe A 
NuS ay: (22) 
From (2) and (9) we have 
Ne= Noe’ — C. (23) 


Setting the right side of (23) equal to the right side of (22) and solving 
for t, we find the moment ¢* at which (22) begins to be satisfied. We thus 


obtain 


2 
1 = Flog (Eo + 5 (24) 

Prior to ¢* we have a, = 0, and the number of nonconformists remains 
constant and small, equal to aN. This does not necessarily mean any lack 
of progress of culture with time, but progress will be very small due to in- 
dividual efforts of single nonconformists. Development of arts can proceed 
under those conditions much more readily than development of science 
and technology (Rashevsky, 1947, p. 178). For ¢> ¢* the situation 
changes. Now a, is given by expression (15), and the number of noncon- 


formists increases. 
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Equation (24), together with (14), shows that ¢* increases with S, 7, and 
f, and decreases with Vo, N., and M. Thus the same factors which increase 
or decrease the value of a. correspondingly shorten or lengthen the initial 
time lag ¢* of intensive development. This of course enhances the over-all 
effect of those factors. 

As the next step we shall consider the fact that the cities are not all of 
the same size. As we said above, a general approach to the problem of the 
distribution of sizes of cities has been suggested elsewhere (Rashevsky, 
1947). Let us see what effect such a distribution will have on a, and /*. 

Let m(n,,)dn,, denote the number of cities with populations between n, 
and m, + dn,. We have 
fom (Ny) dny= mM. (AS) 


0 


Only in those cities will groups of nonconformists be formed for which 
@My > £5, OF te (26) 


The total number of nonconformists in such groups is 


Janam (ny) dn, . (27) 
g a 
Hence 
a foo} 
aaa f num (Ny) diy . (28) 
g/@ 


Let us consider again, as an illustration only, the following example. 
Let m(n.) be of the form: 
Pea es 0) 
mM (Ny) ae e : (29) 
where 7, is, as before, the average size of the city. From (20) and (29) we 
find 


mS gS! Nys)y 


M (Ny) = as (30) 
Introducing (30) into (28) and making use of (19), we find 
ie a al Vins Neesee ane 
re +e) eS Nae (31) 


For V, = 0, a, = 0. We see that a, again increases with WV, hence it 
again decreases with c’. The effect of the environmental factors is qualita- 
tively the same as before but quantitatively more complex. Moreover the 
new factor s is introduced, and, because of (18), this introduces J,. We see 
that a, increases with 7,. We may notice that Western Europe has the 
largest /, of all other continental parts of the world. 
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If m» is the population of the largest city, then we may say that the 
number of cities whose population lies between 7» and « is 1. This defines 
Mm as the root of the equation 


ih m (n,) dn,=1. (32) 


am 


Introducing (30) into (32) we find 


ives. 8 
ine log ~ (33) 
The value of a, will remain zero as long as 
Giles, or tn< =. (34) 
a 


Introducing (23) into (33), equating the latter to g/a, and solving for 
i, we find the time lag ¢* 


1 Sg Ge 
ee e : 
ace [log Noa s log (S/s) ale oe) 


The relation is similar to (24). 

Since the formation of groups of nonconformists begins in the largest 
city, cultural developments will spread also from the largest cities. The 
smaller ‘‘provincial”’ towns and the country are affected later. This con- 
clusion seems to be borne out by history. 

Because of the crude approximations and oversimplifications used 
above, we cannot draw any definite conclusions from the paper which 
could be subject to direct verification. The importance of the above re- 
sults lies, however, in the fact that they show the possibility of large dis- 
parities of rates of cultural developments of peoples whose biological char- 
acteristics are identical. This does not prove, of course, that all the ob- 
served cultural disparities are of an environmental nature. It shows, how- 
ever, that environment may be a sufficient cause of those disparities. 

The author is indebted to Dr. George Karreman for a critical discussion 
of the paper. 
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The probability problems connected with random nets are restated as proba- 
bilities of drawings from an urn containing black and white balls. A partial 
difference equation is obtained and its solution is given. For large nets a series 
expression is obtained for the connectivity y, and this is shown to be equivalent 
to the transcendental equation obtained by R. Solomonoff and A. Rapoport 
(1951). 


Introduction. A random net has been defined by Rapoport as a set of 
n points, each of which is connected by directed line segments to a points 
of the set chosen at random. The weak connectivity y is then the expected 
fraction of the points which can be reached by tracing along the line seg- 
ments starting from a randomly chosen initial point. Solomonoff and 
Rapoport (Joc. cit.) have given various interpretations of this model, and 
also an approximate solution for y which is equivalent to the one derived 
below when z is large. Solomonoff (1952) has also given an exact method 
using Markov chains, but the required calculations are very involved and 
do not lead to an explicit expression for y. 

We restate the problem as an urn problem. An urn contains x balls, w 
being white and »—w black. There is a player who has s tickets. He pays 
one ticket for the right to draw a ball at random from the urn. If the ball 
drawn is white, he receives a additional tickets, if black he receives 
nothing. The ball drawn is always replaced by a black ball. Drawings 
continue until s = 0. 

The random net interpretation of this game is that black balls represent 
points already reached in the tracing process, white balls points not pre- 
viously reached, and the tickets are the number of lines, emanating from 
previously reached points, which have not yet been traced.* 

* In the random net interpretation we may want to exclude the possibility of a line from 


any point directly back to itself. This simply requires that we omit one black ball from the 
urn; the omitted black ball then corresponds to the point from which lines are being traced. 
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Now let E(w, s) be the expected number of white balls at the end of the 
drawings, when the drawings started with w white balls and s tickets. Then 
the weak connectivity is 

E(n, 1) 
yal See (1) 
Also, let p(w, s, #) be the probability that the drawings end with w white 
balls, when they started with w white balls and s tickets. Then the strong 
connectivity, or probability of no white balls remaining from a start with 
all white balls and one ticket, is p(, 1, 0). 
The expected value and probabilities are related by 


E(w, s) = SS up(w, s, 0), (2) 


u=0 
since 
p(w, s,u) =0 for u>w. (3) 


Equation (3) states that the number of white balls cannot increase 
during the drawings. 

Difference equations. We now derive the difference equations for the 
p(w, s, u) by considering the possible outcomes of successive draws. In 
order to have u = w, i.e, no change in number of white balls, only black 
balls must be drawn, so that 


yh) 
p(w, s,w) =(1-") p(w, s1,w), (4) 
since 1 — w/n is the probability of drawing a black ball. For u < w, 
p(w sw) =(1-2) pw, s—1, 0) +(2) pw —1, s— 144, w). (5) 


This equation states that the probability of ending with u white balls is 
equal to the probability of a black ball on the first draw, multiplied by the 
probability of ending with « white from a start with w white and one less 
ticket, plus the probability of white on the first draw, multiplied by the 
probability of ending with « white from a start with w — 1 white and 
a — 1 additional tickets. 

In (4) and (5) we must have s > 0. For s = 0, 


£(D;.0,.u) =O, for UAW, 
(6) 


=1, for u=w. 
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The values of p(w, s, «) for w = 0, as follow from (4), (5), and (6), are 


p(0, s,u) =0, for ie Ue 
(7) 


=1, for w= 0. 


We are mainly interested in the connectivity, so we now derive the 
difference equation for the expected number of white balls. From (2), 


(4), and (5), 


w 


E(w, s) = Dd up (w, s, u) =(1-2)>> up (w, s—1, u) 
u=0 


u=0 
20 
+) du w- 1,s—1+a,u), 


or 


ns) -(1-2)£q, et) +(2)Ew=-1, Seach ~ EE 


The boundary values for this equation are, from (6) and (7), 
Ew, 0) =w, (9) 
E(0, s) =0. (10) 
Solution of difference equation. The difference equations for p and £, 
(5) and (8), are of the same form, so we give the solution for E(w, s). This 
can be arrived at by fixing w and solving the ordinary difference equations 
in s for E(1, s), E(2,s),....The form of the first few solutions suggests 
a general form, (16), for E(w, s), containing undetermined coefficients. The 
validity of this general form and the equations determining the co- 
efficients can then be obtained by standard methods for ordinary differ- 


ence equations (Milne-Thompson, 1933). 
From (8), (9), and (10) 


E(A, s)=(4-2)£U, 5-0), Eno eat (11) 


so that we have immediately 


E(1, )=(1-2). (12) 


Putting this value in the equation for E(2, s), 


E(2, s) =(1-=)£Q, Seeat +(2)-2)". (13) 
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The solution of this equation may be written as 


E(2,s)= A:(1-2) +2(1-2) 


where A» is a constant to be determined from E(2, 0) = 2, so that 


Bs) 22 (1-2) 42 (1-1-4) ]Jr-<y. (14) 


Similarly, 


s+2a 1\2 2\sta 
E(3, 8) =3(1-3)"" +6 [1-1-5 ) (1-3) 
ae 
1\24 1\¢ VIN 8 
+3 {1-(1 eG eee 
This suggests that we take as the general form for E(w, s) 


ae 4 s+(w—i)a 
E(w, y= Dee (1-5) (16) 


where the c; are independent of s. When we insert this expression for 
E(w — 1, s — 1+) in (8), and solve the resulting difference equation 
in Ss, 


8 wal i \ s+ (w-i)a 
W Ww 4 
E(w, = de (19) epee 1-+) A 


Then (9) gives 


wT . . 
w 4\@—i)a 
Beeb Site 


Now we can see that if we put cj = b; (*) , 6; will be independent of w, and 
: 4 L 
the solution can be written 


RCRA Da 5; ) (1 i (17) 


with 


a I \ (wi )a : 
w= a. (7) (1-2) CO) 5g 1d seen ae 


i=1 


This can, of course, be verified by substitution in (8) and (9). 
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The 6; depend on u and a only. The first three values, as can be ob- 
tained from (18) or from (12), (14), and (15), are 


b= As 
b= 2 | -(1-3)'], (19) 


eee inca 


To obtain an explicit expression for E(w, s), we can solve the equations 


(18) in succession for b;, bo, . . . , 6) and then substitute in (17), or we can 
express E(w, s) directly by writing the solution of the w + 1 linear equa- 
tions (18) and (17) for by, bo, ..., bv, E(w, s). The determinant of this 


set of equations is readily seen to be —1 since the coefficient of }; in the 
ith equation of (18) is 1, while the b’s with higher index all have zero 
coefficients in this equation. 

We are interested mainly in E(m, 1) which gives y by (1). Writing 


n= (1-4), (20) 


and noticing that 0, has zero coefficient in (17), we have 
E(n, 1) - Ss. & )s wi (1-+)= 7, (ee aes es 
t=1 


This equation with the 7 — 1 equations of (18) gives the following de- 
terminant for E(n, 1): 


—E(n, 1) = 

i 0 0 =~ 0 1 

Ox 1 0 EG 0 2 

3x2 ey) a 1 ab) Cas 
et pe a oe meen ee ny a a ke ale (22) 
tp —=2) 2% * (Ga an are ae a | 0. #2=-2 
(ql) sn" (ee 4) x8 (oe :) ot en ee n—1 

(n— ipa? ("> *) ar? ("5 {)t — G1) sls m1 0 
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For example, either from this determinant or (19) and (21) 
3\3a 3 a a) 
z(4,1)=3(G) +6[1-G) |G 
3 2a 3 a yp a (=) 

+341-G)-2[1-G)]G)4G@)- 

and, more generally, the first three terms of E(m, 1) are 
1\ (he 
E(n, 1) = (n—1) [(1 -=) 
1\2 2\ (n—-2)a 
+ (n—2) [1-(1--) lG-3) 


i (i 2) eas) fee cael (23) 
2 n 


1\e 2\a 3\(n—3)a 
~2[1-(1-3) ]G-3) ((1-=) +...]. 
Modified equation for coefficients, b;. The expression for E(m, 1) becomes 
very complicated as m increases. For a = 1, it is possible to obtain a 
simple expression directly (Rapoport, 1948), which is shown below to be 
a special case of (21). For a > 1, we consider the limit as 7 > o, since 
this is the case of principal interest. 
For these purposes we first obtain a modified form of equation (18) for 
the b;, which is given in (26) below. 
In (18), multiply each equation up to w = v (v < m) by 


w 
and add, then, 


$= (-> (w= De ey 5. (®) ak (24) 


Now 


2 eo) =1, for v=1, 
w=1 w 


= 0, for ie 21 


as can be seen by noting that this sum is (—1)*A’0. Here we are using 
difference notation and this is a special case of the formula (Milne- 
Thompson, loc. cit.) 


Amir = ss (<p (ator, (25) 
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In the double sum on the right in (24), we interchange the order of sum- 
mation, and obtain 


= SS >> (ae Dai? (>) (ex) ni 


i=1 


=n (ys 5 1 (*5*) « 


i=1 


= (- (7) b.(1—x,)-%, 


t=1 


We finally have, writing w for 2, 


. = w pe \ ee 
oe a i (G)a-ayet=t, w=, (26) 
== OMG ye Oyo 


Now for a = 1, 1 — x; = 1/n, and from this it readily follows that 


b= (4) (27) 


by induction, using (26) and (25). 
Using (27) to evaluate y for a = 1, we obtain after some manipulation 


(1 —1)!% nk 
y= (28) 
n — k! 


which agrees with the result obtained by Rapoport (1948). 
The limit asn > ~, fora > 1. We will first obtain the initial term for 


the expansion of b; in powers of 1/. From (26), 6; is a polynomial in 1 /n. 
Let 
bis id, eee (29) 


We show that 
b. ,=0, for k<i-1, (30) 
(31) 
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From (19) it can be seen that this is true forz = 1, 2, 3. Assuming then 
thatsore mela cena eel > 
ta c=" F 
b= (FE) +aF 


where A; is a polynomial in 1/n, and ae 


1—#,=1-(1- £y' = =24%, 
n 


where y; is also a polynomial in 1/n, we have from (26) 


b= > (—1)°H- xe: | (2 oy + : Fae 
SS OOr oe 


where 7; is a polynomial in 1/ which starts with a term in 1/n”. This 
proves (30). Also 


w—1 w 
fo a wtt—1 rw—1 
ba a =e S ( 1) (°)i ’ 
i=1 


but, using (25), 


w—1 


a Co eae 1G Jie —wi=0, 


i=1 
so 
baer a) 


proving (31). 
We can now obtain an expression for y for large . In 


E(n, 1) — b; n—1 
ya 1S 21 SY ( Jam, (32) 


we have, as 7 > ~ from (31), 


b; (ta) +71 
S— TC — 
n ni ; 


ya 
i nl 


Here f(m) ~ g(n) means f(n)/g(n) > 1 asn— @. Hence 


ECU) ~ Se uo 


and also 
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F 1 (n—1)a 
Pee ee. 
n 


This indicates that as m > o, we have 


Also 


A ad ear Sa Aa (33) 


By examining the ratio of successive terms, it is easily shown that this 
series converges when ae~* < e—1, that is, when a > 1. 

This is not a complete proof, because while we have shown that each 
term in (32) approaches the corresponding term of (33), it does not follow 
that the sum in (32) has for its limit the series in (33). A direct proof 
seems to present difficulties so we approach the problem from a different 
direction. 

Solomonoff and Rapoport (Joc. cit.) have given the following equation 


for y: 
eee, (34) 


which is valid for large m (Rapoport, 1951). This can be converted into a 
series for y in terms of a by using Lagrange’s formula (Goursat, 1927). 
Lagrange’s formula gives for 


y=atxo(y) 
the following series in x: 


eke Gol Gps : 
y=at ee aes) 


To apply this to (34), we make a change of variables 


y =a (1 — 7), 

x=ae’"; 
then (34) becomes, 

y = XY e¥ 5 


and applying Lagrange’s formula gives 


i=1 


Replacing x and y by a and y we obtain (33) again. 
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Strong connectivity. The probabilities p(w, s, w) can be calculated by the 
same method as was used for the expected value. We will not give details 
but merely state the results for p(w, s, 0): 


w w 4\8t wie 
pw, s, 0) =1— ee (Z) (1-5) 
with 
w i \ (w—t)a 
ds (8) 4-4) =1, (35) 
i=1 
and 


im hg=(—1)*". 


It can be shown that the strong connectivity p(m, 1, 0) approaches zero 
asn— o, 

This work was aided by a grant from the Dr. Wallace C. and Clara A. 
Abbott Memorial Fund of the University of Chicago. 
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